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Preface 



THESE LECTURES were given during a seven-week course at the Tata 
Institute of Fundamental Research. The aim was to provide an intro- 
duction to modern sieve methods, i.e. to various forms of both the large 
sieve (part I) and the small sieve (part II), as well as their interconnec- 
tions and applications. Being a were of the fact that such a goal cannot 
be reached in such a short time. I have tried to compromise between an 
introduction and a survey. The difficult task of deciding what to omit 
I have tried to overcome in most cases by presenting the simplest ap- 
proach in details and a sketch of the more sophisticated results if their 
proof would have required too much time. Nevertheless I have decided 
to include a chapter an the history of the large sieve upto Bombieri's 
first paper, because I believe that a student coming to a new branch of 
mathematics can learn much more from the historical development in 
that then is generally expected. The final chapter contains a proof of 
Chen's Theorem, because I consider it the most beautiful example of 
the interaction between various sieve methods and other powerful tools 
of analytic number theory. 

I am indebted to my colleagues at the Tata Institute for their gener- 
ous hospitality, particularly to K.G. Ramanathan and to K. Ramachandra 
for many interesting discussions. 

The notes have been prepared by S.Srinivasan. His critical ability 
has been of great value to me, and I wish to thank him his meticulous 
handling of the manuscript. 

H.-E. Richert 

Bombay, April 1976 
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Introduction 



SIEVE METHODS, beyond that of Eratosthenes and of Legendre, can 
be considered to have started with the works Brun (small sieve) and of 
Linnik (large sieve). In first part of these lectures we confine ourselves to 
an introduction to the large sieve and a survey of its applications. Under 
Chapter we give a historical introduction to the theory of the large 
sieve pertaining to the works, upto the first paper of Bombieri (1965), 
covering a period of twenty-five years. 

Regarding the relative powers of elementary sieve methods and the 
analytical methods one usually considers that the latter should be more 
powerful. Further it has generally held that large sieve is than the small 
sieve (also that Selberg's sieve always supersedes Brun's sieve). But 
history has shown that such views are not totally correct. 

As for the first point we elucidate the connection between the ele- 
mentary large sieve method and the analytical methods in number theory 
by recalling briefly some of those basic methods used in number theory. 

1) In multiplicative number theory one has the important problem of 
finding asymptotic formulae for the sums £ a n as x — > oo, where a n s 



are values of some number-theoretic functions. Introducing the func- 



tion F(s) - J] a n n~ s .s e C, the connection with analytical methods 

n=l 

is brought about through the following formula, due to Dirichlet: 



where c is a real number exceeding the abscissa of convergence (fi- 



ns;* 
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nite in practice) of F(s). (Hence and in what follows integration 
along straight lines are always in the direction of non-decreasing 
real and imaginary parts.) To clarify the use of this formula let us 
take F(s) = £ 2 (s), which corresponds to the problems of finding the 
asymptotic formula (with error term) for £ d(n). In this case one 

n<x 

can take for c above any value greater than 1. One can show now 
that the major contribution to the above integral comes from the part 
| Im s\ < T, provided T is suitably large in relation to x (we also 
choose c sufficiently close to 1). Next shifting the line of integration 
to the left one has that the remaining (major) part is, by Cauchy's 
theorem. 

= 2mR + ( J + J - J )f(s)?-ds, 

Re s=s s < Re s> c s< Re s>c 
| Im s\< T Im s =T Im s=T 

where (0 < 1 and R denotes the sum of residues of the integer 
and at its poles within the rectangle bounded by the lines | Im s\ - 
T,Re s - £ and Re s = c. It turns out that the estimate for the first 
integral above dominates those of the other two and is itself 'negli- 
gible' provided T is not too large in relation to x; in otherwords, if 
T is appropriately chosen, then R is the main term of the asymptotic 
formula for £ d(n),x — > oo. Thus, in this case, we are led to the 

n<x 

following 



nex 



d(n) - x log x + cqx + 0(x ), x 



with some constants c and 9,0 < 9 < 1. (Clearly, the restriction that 
x is not an integer can easily be dropped here.) 

2) As to additive number theory we again consider a simple case only. 
Let y be an (infinite) set of non-negative integers and let a n be the 
characteristic function of y; i.e., 



a n = 



1 1 if n e y, 
[0 if n i y. 



Introducing the function 



CO 



a n Z 



n=0 

(so that the power-series has radius of convergence = 1) the formula 



provides the analytical connection with the problem of finding (the 
asymptotic formula for ) the number of representations of Nas a sum 
of two numbers of y. For instance, when y is the set of primes this 
corresponds to the well-known Goldbach problem. Then it turns out 
that the major contribution to the above (integral) comes out of the 
points z with arguments 'close' to fractions with 'small' denomina- 
tors, while r approaches 1 as N — > oo, and the set of such points z 
constitute the 'major arcs' and the remaining parts are termed 'minor 
arcs'. Thus again we see that to get information about our problem 
one has to move close to the singularities (on the unit circle) of our 
function. 

The functions introduced in 1) and 2) above are particular instances 
of general Dirichlet series 



where A\ < Xi < As <•••—> oo. Still the above problems have this 
essential difference: Under 1) we encountered isolated singularities 
and on the other hand, regarding 2) one knows from gap theorems 
that, for examples, for the function £ z p the unit circle is the natu- 



ral boundary. However, both the cases illustrate the principle of the 
analytical methods in that the singularities of the associated function 
are the sources of arithmetical information regarding the concerned 
problem, in the sense that heavier the singularity more is its contri- 
bution to the main term. 





a„ e C, s e C, 



n=0 



P 



X 
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The idea of the second of the methods sketched above, the 'Hardy 
- Littlewood method', goes back to Hardy and Ramanujan. It has 
been later developed in a series of papers by Hardy and Littlewood. 
(A variant of this method, introduce by I.M Vinogradov, which uses 
finite sums instead of series, allows integration over the unit circle.) 
I this method, the aforementioned principle is reflected in that the 
contribution, of the major arcs (i.e., neighbourhoods of heavier sin- 
gularities), the so-called 'singular series' of the problem, determines 
the main term. 

We are now in a position to indicate as to how the 'elementary' 
large sieve method can be regarded as being analogous to the corre- 
sponding analytical approach In its basic form the large sieve relates 
the mean-square contribution from the mid-points of the major arcs 
(the sources of arithmetical information) of the size of the associ- 
ated function with the mean-square integral, similar to the singular 
series (above) being related to the integral over the unit circle. Thus 
the method links up an arithmetical information with the gross mean- 
square (= number of elements in y, if a n is the characteristic function 
of y). This is to suggest that this elementary' sieve method can be 
considered analogous to analytical methods. 



Notation 



In general all the notation employed in these lecture are either standard 
or are given explicitly at the place of their first occurrence. So we limit 
our seives here to a description of the former type followed by the ones 
of the other kind (along with the place of their first occurrence in paren- 
thesis, for the convenience of reference). A reference (A.B) to a part in 
these lectures stands for '"formula B'of 'chapter A'". 

The letter p (with or without affixes) denotes invariably a prime 
number. An 'almost prime' P r (cf. (12.8)), for a given integer r < 1, 
is a natural number with not more than r prime factors (counted with 
multiplicity). The greatest common divisor of two integers m and n is 
denoted by (m, n). For an integer n, the divisor function d(n) (cf.p.ii) 
denotes the number of (positive) divisors of n. We also use von Man- 
goldt's function A(n) (c/.(1.75)) defined as log p or according as n is 
a power of (some prime) p or not. As usual, for a real x, [x] denotes 
the greatest integer not exceeding x. Euler's constant y (cf. (8.33)) is 
(lim^ 00 ( £ n x -logx)). 

1 <n< x 

The order notation O, o «;,» have their customary meaning and 
the dependence of the implied constant on some auxiliary parameter (s) 
is (when essential) given explicitly. (The notation x , meaning »<sc, 
occurs only once in connection with ((0.51).) 

The symbols N, Z, R, C denote respectively the sets of natural num- 
bers, integers, real numbers, complex numbers endowed with their nat- 
ural (basic) structures. Regarding intervals (of reals or integers ) the 
convention of using the brackets ), (to indicate the excluded end-point 
(s) and ], [ to indicated the included end point (s) is adopted. Also we 
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0. Notation 



\A\ to denote the cardinality of a finite set (or sequence)^). 

Finally, the notation explicitly introduced in the lectures: We have 
||x|| (cf '.(2.10)), e(u)(cf. (0.52)). The number-theoretic functions v(n) 
(cf. (1.9)). /i(n)(c/.(1.8)).^(n)(c/.(1.19)),c 9 (n)(c/.(1.29))^(n) 
(c/.(1.33)) and r(«)(c/.(6.71)) occur more than once. Also for the sum- 

q * 

mation conventions (£, £ ) and £ see respectively (1.29) and 

t= 1 l mod g ^ mod q 

(3.6). The following conditions are of repeated use in the second part of 
these lectures: 



(n 1 )(c/.(9.16 a )or(9.16 fe ));(Q 2 (A:.L))(c/.(11.3));(Q 2 W)(c/.(11.4)); 
(/?)(c/.(11.9));(n o )(c/.(ll.ll));(0(c/.(11.23));(/?(l,a))(c/.(11.62)). 
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SIEVE METHODS 



Chapter 

History of the Large Sieve 



LET y BE a set of \y\ integers contained in an interval of length N: 1 
y c (M,M + N),M e Z,N e N,5 := \y\. (0.1) 

Setting 



n=fmodq 



we have 



or that 



Y,S{q,t)=S, (0.3) 



q 



Y(S(q,€)--) = 0. (0.4) 

Confining, for the moment, our attention to primes q - p only, 
( 10.41 ) tells us that the quantity 

P c 

D(p):=Y{S{pJ)--) 2 (0.5) 

h p 

measures how uniformly the set y is distributed among the residue 
classes mod p. Such an information, is of great importance in various 
problems. 



1 



2 



0. History of the Large Sieve 



A uniform and non-trivial bound of the form 



Y,PD{P)<K(N,Q,S) 



(0.6) 



P<Q 



by uniform we mean here that while K may depend on N, Q and S , it 
should be independent of the particular structure of the set y-makes it 
possible to draw the following general conclusions. (Note that actually 
2 the supposition M = here involves no loss of generality.) 

(A) Let N, Q and S be given. If every set y (cf. dO.U ) is so uniformly 
distributed over the residue classes mod p as expressed by (D), 
then for most of the p's D(p) must be small. This remark tells 
us since D(p) is bound to be large if many residue classes mod p 
do not contain any element of y, the statement that S (p, £ ) = 
for 'many' Ts mod p can be true for only 'few' ('exceptional') 
primes p < Q. 



We can express this remark in a quantitative form. Let io(p) be an 
(integer-valued) function satisfying 



< 0)(p) < p, 



(0.7) 



and now we ask for the number of p's < Q, for which at least 
co(p) residue classes mod p do not contain any element of our set 
y. Let us denote the set of these 'exceptional' primes by p, and 
set 



min — ^- = 6 = 6(p). (0.8) 



pep p 



One has then for each pep 



S 2 

pD(p) > pOj(p)—r 

P 1 



> 6S 2 , 



(0-9) 



and so 




(0.10) 
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Trivially 

pep p<Q 
and therefore (10.61) and AO. 101) give 

For the remaining primes i.e., p £ p and p < Q, less than a>(p) 3 
residue classes mod p are devoid of numbers of y. Consequently, 
for these primes p each of at least p - ta(p) residue classes mod p 
contains atleast one element of y 

(B) The preceding result may also be considered as a sieve problem. 
In order to see this, let us start (cf. (10. Q ) with the set of numbers 

M +\,...,M + N, MeZ.iVeN. (0.13) 

Now for certain primes p < Q, p € p say, strike out of numbers 
(10.131 1 all those numbers which are situated in any of certain a>(p), 
where a>(p) satisfies (10.71 ). of the p residue classes mod p. Let the 
remaining set of numbers be our set y. We obtain ( 10.121 again 
under the present situation, since we have used for its proof only 
that at least to(p) residue classes mod p (for each p e p) contain 
no element of y. Next resolving <I0. 12b with respect to S - as shall 
be seen later to be possible we get an upper bound for \y\ with 
respect to our set y above. It is the type 

(B) (\y\=)S <K t (N,QMS). (0-14) 

(C) Our remark at the beginning of A had been that a non-trivial es- 
timate of the type (D) implies D(p) is small for most of the p's 
under consideration. If we view this as the statement that 'most 

often' — is a good approximation to S{p,£) then its quantitative 
P 

version leads to a more precise formulation than that under A and 
consequently than that under B.. In fact, we obtain a result of the 
type of Cebysev's inequality. 



(0.11) 



(0.12) 
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To this end we introduce a function c{p) satisfying 

c(p)>\ (0.15) 

and put 

maxc(p) = c - en. (0.16) 

p<Q 

Now we ask for the existence of the inequality 

\S(pJ)-^\<-?— (0.17) 
P pc(p) 

More precisely, we ask for the number of primes p < Q for which 
the inequality (10.171) does not hold for at least co(p) (cf. iO.ll ) residue 
classes I mod p. Let us denote again by p the set of such exceptional 
primes. Then, in the notation of (I0.8I) . 



P D(p) > co(p)^— > -S 2 , Vpep. (0.18) 



£- > Is 2 

p 2 c 2 (p) c 

Consequently, in view of AO. 111) . dQ.6t gives for the number of ex- 
ceptional primes p the upper bound 

c 2 

(C) IpI£t=2*(#,G.S). (0-19) 

This result may be phrased as follows: iO. 61 implies that for every 
set y (of HO. II ) one has for all p < Q. save for atmost 

c 2 

—K{N,Q,S) 
6S 2 

primes, and for all residue classes £modp with the exception of less 
then co(p) of them 

S( P J) = - + 0——, where \9\ < 1. (0.20) 
P pc(p) 

Choosing c{p) = 1, we see that each exceptional prime of A, is 
also an exceptional prime of C. and conversely. Therefore C. is a 
generalization of A. and hence also of B. 



5 



Although the uniformity of d0.6t with respect to y may be consid- 
5 ered a defect (because it includes all 'bad' sets) it has the advantage of 
drawing all the conclusions (of which we have given three general ex- 
amples in A, B. and C.) valid for all sets , including those which are 
otherwise not readily accessible. The quality of the K's known for (10.61) 
allows us to obtain results which are not available by the use of other 
methods. 

(I0-6I > is of interest for Q < N only. A trivial estimate is obtained in 
the following way. By ( 10.51 ) and ( 10.31 1 we have 

P S 2 

D(p)=Ys 2 (p,f)-—, (0.21) 

n p 

and trivially 

^ N N 

S(p,€)< 2j 1 < — + 1 < 2— (p<N). (0-22) 

M<n<M+N & P 

n=C mod p 

Using this and d0-3l > again in (I0.2H we find that 

p 

pD(p) <2N^S (P> = 2NS , (0.23) 

t=\ 

and so 

(D ) YjP D (P^ 2N Q S (Q^W. (0.24) 

P<Q 

As we have seen above (A),(B) and (C) are more or less different 
versions of general types of results based on (D). Linnik (T) was the first 
to consider such problems. With respect to B. he made the remark that 
when striking out an (absolutely) bounded number of residue classes 
mod p the sieve method of Brun (or of Selberg) is applicable. However, 
this is no longer true if, as is permissible under B., u(p) is, for example, 
an increasing function of p. For this very reason, Linnik named his 
method of treating A. and B. 
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0. History of the Large Sieve 



"The Large Sieve". 

Linnik [ 1 ] proved that 

(Ai) I Pi < 20n-^ for Q = Vn, (0.25) 

and consequently 

(fli) S < 20n^- for Q = <N (0.26) 

As an example he takes y to be the set of primes < N, i.e., M = 
and 5 = n(N), and w(p) = p 3/4 . Then 5 > N~ 1/s and (A{) yields the 
non- trivial estimate: 

|p| < 20^ — —r < 80A^ 1/4 log N, for N > N (0.27) 

Although the large sieve first occurred in the forms (A{) and (Bi), I 
would prefer, in particular, in view of later developments, to call (D) the 
large sieve and rather consider results of type (A), (B),(C) and others as 
applications of the theory of the large sieve. 

Following Linnik it was Renyi's (ID, El) merit to generalize the 
large sieve method in several respects. Simultaneously he noticed the 
fundamental importance of (D), and also treated the more precise ver- 
sion (C) for the first time. 

Generalizing d0.2t . for arbitrary complex numbers a n we set 

S(qJ):= J] a » (°- 28 ) 

M<n<M+N 
n={modq 

and 

5:= 5(1,1)= J] a n . (0.29) 

M<n<M+N 

(By taking for a n the characteristic function of y in ( 10.281 we get (10.21 .) 

Let p be an arbitrary set of primes p < Q. Renyi's paper [2] has 
implicitly the following (explicit) generalization of (D) (with Q < VAO: 



0i) 



pef 1=1 M<n<M+N (0.30) 

+ ^| P | ke< !E, 



7 



where a' s are > 0, m = max a n , q is a squarefree number not divis- 

M<n<M+N 

ible by any pep, and < e < — . 

The use of a set p of primes here is particularly suitable for appli- 
cations of the type (A), (B) and (C), because p can serve as the of ex- 
ceptional primes, and the factor |p| instead of Q improves the estimate. 
From this generalization to composite moduli and arbitrary coefficients 
he derived ((Q, (Lemma 1)) a (C)-type result about 

\s( Pq ,e)- s J^-\<-?—. (o.3i) 
p pqc(pq) 

From here he succeeded ([ 1 ], (Lemma 2)) in making the large sieve 
applicable also in the estimation of certain averages of character sums, 
i.e., sums of the form 

2 X(n)a n . (0.32) 

M<n<M+N 

!1 for n e y, . 
i,e. to d0. 17l >. 
for n <£ y, 

then Renyi's C-result corresponding to dO.l^t is 

(Ci) |p| < -^J^l for 2 < VyV. (0.33) 

By using a different method of proof, Renyi ([7], (18)) next proved 
(Oi) J] PD{P)<2NS (0.34) 

p<(A r /12)'/ 3 

and then applied this to C. and A. obtaining ([7], (Theorem 3)) 

2Nc 2 N 

(C 2 ) |p| < — ==- for £<(-r) 1/3 (0.35) 

a!> 12 

and (Q, (Corollary 1)) 

(A 2 ) |P|<^ for 2<(^) 1/3 - (0.36) 
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Finally Renyi improved his method and considered the large sieve 
in its (D)-version as a special statistical statement ([4], [8|, [10], [12]) 
and he showed ([4]) 

(D 2 ) J] P D(p)<9NS (0.37) 

and from this ([4], (Theorem 3)) 

9Nc 2 1 
(C 3 ) IPl < -r^- for Q<-N 1/3 (0.38) 

2 

The last results above, though stronger than Linnik's, are still valid 
only for a smaller range for the primes, i,e., for smaller values of Q. In 
[9] (cf. Halberstam and Roth [ 1 ] (Ch. IV, Theorem 6')) he prepared the 
ground for the extension 

(D 3 ) J] pD(p) <{N + Q 3 )S for Q < Vtf. (0.39) 

p<Q 

Barban has been the first to prove a (D)-result by using Linnik's 
original method. He showed ([8], (Theorem 1)) 

NO K 2 n , N 
(Z> 4 ) > , PD(P) < 2n-^S + -f S 2 for 1< K < rmn(Q, -), 

K ° Q Q 

(0.40) 

and for 0{) in the case q = 1 (| 10], (1.3); for q > 1 cf. El, (Theorem 
3.1)) 

:^pi:U\s(p,p>- 1 p \ 2 < i Z «n+ 

(D 2 ) „ ' m<,<m + w (0.41) 



l+ 4 ™ 5 IPl for <2<N, 



subject to the conditions < e < min( - , — -), d« > 0. The partic- 
J 2nN 2Q 2 

(1 for n € y, 
(i.e., 5 = 5) of this result contains (D4) 
for « £ y, 

and from it Barban ([ 10]. (Theorem 1.1)) derived for A. 
(A 3 ) IPl < 20-^r for Q=Vn. (0.42) 
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The importance of (D4) lies in the fact that there is no longer any 
restriction on Q, apart from the natural one, namely Q < N, implied by 
the ^0 -condition. 

There is another generalization to a weighted form of (D) due to 
Halberstam and Roth [ 1 ] (Ch. IV, Theorem 5). They proved that for 
arbitrary weights 5 P , satisfying 

0<6 P<2jQ ( °- 43) 

one has 

(D*) J] pS p D(p) « S + N 4 2 4- t - 44 ) 

P<Q p<Q 

A very important progress was made by Roth [2], who succeeded in 
proving 

(D 5 ) 2^D(/>)«(Af + <2 2 log£ () )S +S 2 \p\K Q 2 for K > 2. 

pep 

(0.45) 

where again p denotes an arbitrary set of primes p < Q, and there is no 
restriction on Q. This result includes in particular ([2|, (9)) 

(O e ) J] pD(p)<NS, (0.46) 



/ N 
' logJV 



as well as (0, (Corollary 1)) 10 

(D 7 ) Y pD(p)«SQ 2 log Q for > L^-. (0.47) 

With respect to (C) Roth ([2|, (Corollary 2)) derived, also noticing 
that it is more appropriate - as can be seen from dO.lBt to introduce 

P := imin^- (0.48) 

4 P <Q pc 2 (p) 
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0. History of the Large Sieve 



instead of 6 and c separately, from (D5) 
Ar + <2 2 log± 

(C 4 ) |p|« — P - for Q<N. (0.49) 

It is not too simple a matter to make a comparison of the various 
basic (D)-results of the large sieve. However, the main features are the 
following ones.The very effective estimate (for K(N, Q, S) of iO.61 ) 

« NS (0.50) 

which was known by Renyi for Q upto N l/3 (cf. (10341 . d0371 . (10391 ) 
has been extended by Roth (cf. (I0.46t ) upto Q = yjj^- For values of 

Q beyond V#, (10301 and (10371 still yield non-trivial estimates (com- 
pare (I0.24l i) upto the vicinity of N. Apart from the factor log Q and 
the «;-constant, d0.47l > is in most cases the better estimate. Only for 
Q * N 2 S) l/5 does d030l yield 

« Q 2 S (0.51) 

which is dO.471 ) without the factor log Q. On the other hand, the same 
result is implied by (I0.45t if moreover S «: Q log Q. 
11 As far as the methods of proof for the large sieve are concerned there 

are different ways of approach. 

Recalling our notation introduced in the beginning of this chapter 

(1 for n € y, 
only) the 
for n <f. y, 

first method, used in the aforementioned basic papers of Linnik [ 1 ] and 
of Renyi [2]. is based on a treatment of the exponential sum 

T(x) : ^ < nx ) ■ ieR - e{u) := e 2niu (0.52) 

ney 

Here the essential use is made of the Farey dissection from the 
method of Hardy and Littlewood and of Parseval's formula. In fact, 
the close connection with T{x) stems from the identity (cf. d7.lt ) 

p ~ l I p S 

2 |7\-)| 2 = pY j (S(pJ) ~ pf = PD(P), (0.53) 

i=\ P e=i 
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the second equality being ( 10.5b . For the proof of the first equality we 
need only note that the left-hand side expression 

= ~ sl + Z i r ^)i 2 = - s2 + Z Z Z - ^)) 

and further, by dl.221 ). the inner sum in the last expression = p or = 
according as {n\ - n%) is = or £ Omod p, so that our expression 

p p „ 

= -S 2 + Y,S\p,€) = p^S(p,t) - -) 2 ) 

e=\ c=\ 

by (10.3b . Hence the form (D), which we have considered so far to be the 
basis of the large sieve, amounts to asking for an upper bound for 




(0.54) 



Another method of proof that should be mentioned here takes a more 12 
general point of view and may serve to simplify the understanding of the 
large sieve method. It relates the problems with certain results in an in- 
ner product space. This has been already developed by Renyi in his 
early papers ([4], [7]), where he refers to Boas Q and also to Bellman 
[ 1 ] for their extensions of classical results to 'quasi-orthogonal' func- 
tions. Roth's work [2] has used king of combination of both methods. 

Further methods and details of proofs with respect to the theory 
of the large sieve will not be given here, but rather would be men- 
tioned in appropriate chapters, in part icular under chapter However, 
since we will not have the opportunity to use the second method men- 
tioned above, we shall present here a basic result due to A. Selberg (cf. 
Bombieri [4|). The proof is very elegant and the result seems to me to 
be most suitable for giving an idea of this method, which consists then 
in choosing appropriate functions in an application of d0.55t . 
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Theorem 0.1. Let f, tp\, . . . , be elements of an inner product space 
over C. Then 



(0.55) 



Proof. For any complex numbers c r , 1 < r < R,we have by the Bessel's 
inequality argument, 



R 



f\\ 2 - 2Re Z c r {f,(f r ) + Z c r c s ((fi r ,(fs) 

r=\ r,s=\ 

11/ - Z CrfrW 2 > 0. 
r=\ 



(0.56) 



Using here 



( R 



R 

z 

r,s=l 



Z C r C 5 (^,.)< Z (5lCr| 2 + 5l^| 2 )l(^,^)l 



= Z \Cr? Z 1(^,^)1 
r=l s=l 



13 and then choosing 



the proof is completed. 



(f,<Pr) 

R 

Z l(^r,^)l 



(0.57) 



(0.58) 



From Theorem 10. II or variants of it the main tools for this (second) 
method can be derived (cf. Montgomery [5] (pp. 4-8) and Huxley Q 
(pp. 29-30)). A simple example is that as an immediate consequence of 
( 10.551 we obtain, under the same assumptions. 'Bellman's inequality' 
(Bombieri 0) 



£ \(f, ?r)\ 2 < \\f\\ 2 max V \{y r , <p s )\. (0.59) 



r=l 



s=l 



( 10.551 as well as ( 10.591 ) generalize Bessel's inequality, or (for R = 1) 
Schwarz's inequality, in an inner product space to which they reduce 
when {(p r } happen to be orthonormal. 
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The importance of this method for the modern development of the 
large sieve has been noticed by Bombieri, Gallagher and A. Selberg (cf. 
Bombieri 0, @). 

The further development in the theory of the large sieve has shown 
that with the work of Roth one had already come close to best possible 
results. The next decisive step in this direction was made in an im- 
portant paper by Bombieri [ 1 1 (also independently in a paper by A.I. 
Vinogradov Apart from the important deductions he made from 
his result and other details (not to mention here), the main features of 
this progress in theoretical respect were (i) the extension of his ((D)- 
type) result from an estimate of (I0.54t to an estimate (of the larger gum) 
where the summation is extended over all natural numbers q < Q in- 14 
stead of over only primes p < Q (cf. (I2.3t ). (ii) keeping thereby not 
only the quality of Roth's result but even removing a log-factor, and (iii) 
obtaining an explicit ^-constant, which is of considerable importance 
in certain applications. 

After having recalled some arithmetical results in Chapter^we shall 
take up this modern version of the large sieve in Chapter|2 Applications 
to character sums are possible, as has been already mentioned in d0.32t . 
and will be treated in Chapter [3] Important further applications of the 
large sieve, not mentioned so far, to Dirichlet series which were first 
noticed by Davenport (cf. Montgomery [2|) are dealt with in Chapter 
0] This theme is continued in Chapter |5] where certain 'hybrid' forms 
of the large sieve for applications to Dirichlet series also occur. Chapter 
|5]is devoted to a survey on special applications of the large sieve to 
Dirichlet series and also to some problems of number theory: In Chapter 
0we shall turn to the large sieve in its arithmetical form (B). Lastly, in 
Chapter|S]we give an application of the large sieve to a special problem, 
Viz. Brun-Titchmarsh theorem, of number theory. 

So far very little has been said about the applications of the large 
sieve. As can be seen from the description of the contents of the follow- 
ing chapters given above there is a great variety of applications. Many 
of these are of a 'statistical' nature, in the sense that they are concern- 
ing certain averages. Many results of number theory are known to be 
consequences of certain unproved hypotheses. However, in many cases 
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we are able to apply the aforementioned statistical statements to obtain, 
strikingly, the same results as those which one gets by assuming certain 
still unproved hypotheses. 

Apart from a first application in the construction of a non-basic es- 
sential component (Linnik [ 2 ]). Linnik [ 3 ] showed the power of his new 
15 method, via (A), in a result about the least quadratic non-residue mod p. 
Renyi's first application of the large sieve yielded a surprising result in 
the direction of Goldbach's conjecture. In fact, Renyi has been the first 
to prove that every sufficiently large even number can be written as a 
sum of a prime and a number consisting of a bounded number of prime 
factors. 

We shall not mention other applications in this historical introduc- 
tion but rather defer them to later (appropriate) chapters. However, in 
keeping with the title of this chapter, following the Notes for this chap- 
ter we add a list of references to works, upto the first paper of Bombieri, 
in chronological order. This list also includes papers, upto this point, 
which deal only with the applications of the large sieve. 

There are also generalizations of the large sieve in various direc- 
tions. Some of these papers are given in a second list of references 
following the above one. 



NOTES 

In order not to make this purely historical introduction too long, we 
have selected only some significant results that seem to be illuminat- 
ing for our way of presenting the subject. For further information the 
reader is referred to the surveys in Barban [10], Halberstam and Roth 
[ 1 ]. Davenport [ 1 1, Roth [4] Montgomery [5]. Huxley [7] and Bombieri 

13, 0. 

(I0.25I) . (I0.26l l : By using \e ,x - 1| < \x\ instead of Linnik's estimate 
\e' x - 1| < e\x\ the constant 20 can be replaced by 4. 

( 10.331 ): Renyi's paper ([2], (Lemma 1)) contains only the special 
case oj(p) - p % ^ ', c - p 1 ^ 9 . However, as he has pointed out ([7], (Theo- 
rem 2)) his method also applies to the general case. 
16 dD39b : Note that by (D ), d0~3^t holds also for Q > Vw. 
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(10.401) : His method gives actually a factor - for the second term, 

and also the condition on Kq may be slightly relaxed. 

(10.391 . (10.401 1 : In general (D3) yields the better result for Q in the 
vicinity of N 1 ^ 3 whereas (D4) becomes superior when Q tends to V^V- 
Also (D4) has been the first result to yield non-trivial estimates upto the 
vicinity of N. 

(10.421 1 : Actually, the more appropriate choice of e = yields the 

2jtN 

factor 3jt instead of 20, and a simple refinement of the proof gives even 
3 

-7r. A consequence of this remark is, when applied to C, 

3 n Nc 3 r- 
(C 5 ) |p| < -^j for Q = Vtf, (0.60) 

a result which is always better than Renyi'es ((C,)). 

(I0.44l i: Following Barban's method of proof for (D4) one notices 
that in (D*) the factor N 4 can be replaced by N 2 S 2 . 

d0.56l>: Here we have used that 



(0.61) 



J ll/ll 2 = (/, A (/, g + h) = (f,g) + {f, h), 
\(cf,g) = c(f,g),(g,f) = (f,g) 

and (hence also) 

(f,cg) = c(f,g). (0.62) 
For an application of Theorem lO.ll see the notes following Chapter|2 

History of the large sieve. References in 
chronological order. 

Linnik O, Boas (D, Linnik 0, 0, Bellman [1], Renyi O, 0, 
0, El, 0, 0, 0, 0, Bateman, Chowla and Erdos 0, Kubiliyus 
0, Renyi [9], Wang 0, Renyi (TO), EL Stepanov 0, Hua 0, Renyi 17 
fill. Barban 0, Linnik 0, Barban 0, Erdos 0, Barban 0, Gel- 
fond and Linnik 0, Pan 0, Erdos 0, Pan 0, Wang 0, Levin 0, 
Barban 0, 0, 0, Pan 0, Rieger 0, Pan 0, Barban 0, Roth 0, 
Levin [2|, Barban 0, 0, M. and S. Uchiyama 0, Barban (TO), Wang, 
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Hsieh and Yu Q, Halberstam and Roth Q, Roth 0, Buchstab [ 1 1, A.I. 
Vinogradov [ 1 ], Bombieri [ 1 ], 

Extensions of the large sieve 

Andruhaev fl], Fogels fl], Goldfeld 0, Hlawka Q, Huxley 
CD, 0, El, Johnsen fl], Rieger fl], Q, Samandarov ffl, Schaal QJ, 
Wilson P. 



Chapter 1 

Arithmetical Aids 



IN THIS chapter we shall collect for the reader's convenience some of 18 
the results, which are use in later chapters, from elementary number 
theory. 



1 Multiplicative functions 

By 

m (1.1) 
we denote the set of functions (defined on N) 

/*0 (1.2) 

that satisfy 

f(nq) = f(n)f(q) Vn, q € N (1.3) 

whenever 

(«,?) = 1. (1.4) 

Since we have excluded the trivial / by dl.2t . there is a q such that 
f(q) + and so (11.31) implies for n = 1 

fern =>/(!) = !. (1.5) 



17 



18 



1 . Arithmetical Aids 



Obviously 

f(n) = n z € m for every z £ C, (1.6) 
so, in particular, for z = 

fin) = 1 e m, (1.7) 

and for these functions the restriction ( 11.41 ) is not even necessary. A 
non-trivial example is provided by the Mobius function, defined by 

[O if n is not squarefree, i.e., there is a prime p : p\ 2 n, 
if n j s squarefree 

(1.8) 

19 where, as usual. 

v(n) = £l (v(l) = 0). (1.9) 

p\n 

For y, ( 11.3ft is easily checked, because, if atleast one of the numbers 
n and q is not squarefree we have zero on both sides, and if both n and 
q are squarefree we get ( 11.31 ) subject to dl.4t by using 

v(nq) = v(n) + v(q) for (n,q) = 1; (1-10) 

hence 

/iem (1.11) 

A simple way of obtaining new functions is by multiplying together 
( 11.31 ) for any two such functions, so that 

/i,/2em=>/i/ 2 eTn. (1.12) 

A less trivial result is 

f u f 2 e m => g(n) 2/iW/ 2 (^) e m. (1.13) 

d\n 

For a proof we first note that if d\nq there is, in view of dl.4t . a 
unique factorization d = th with t\n, h\q. Therefore, by considering 
dl.3b . for both functions, under the condition dl.4t . we obtain 



g("?) = EE/i(^)/2(fI) 
Z/i(0/ 2 (?)E. 



- Z /i(0/ 2 (? ) Z Mh)M$ - «(n)g(«). 



2. Ramanujan's function 
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For a / € m it follows by repeated application of ( 11.31 ). subject to 20 
(PL that 

fe m ^f(n) = Y\f(p ap ) where rc = ]~[/A. (1.15) 

p\n p\n 

so that these functions need only be known at prime-powers. In particu- 
lar, for squarefree number one has 



fem=>f(q) = Y\f(p) if Kq)*0. 

p\i 



(1.16) 



A good illustration of this property of functions in m is given as 
follows. Suppose that f\, f% e m, and let q be a squarefree number. 
Then g e m, where g is defined by (11.131) abd from (11.161) we get 



W) = ns(p) = n(£/i(<o/ 2 (|)) 

p\q p\q d\p 

= n(/l(P)/2(l) +Mm(P)) for Hiq) * 0. 



Therefore, in view of d 1 -5ft . we have proved that 



(1.17) 



/i,/2em=>Z/ 1 (d)+/ 2 (J) = 
4« 



n(/l(p)/2(»). if Kq) * 0. 

pl<? 



(1.18) 



For an application we note that by ( II. Ill ) and ( 11.71 ) one can take // 
and 1 for f\ and fi respectively. Then, denoting by q(n), the 'kernel' of 
n, i.e., the largest squarefree divisor of n, ( 11.181 ) yields 



2m4)= 2^ = n (j "^ )+i) ' 



(1-19) 



which gives the well-known formula 



<2|r 



1 1 for n = 1 
for n> 1. 



(1.20) 
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2 Ramanujan's function 



Recalling our notation (cf. d0.52t ) 

eiu) ■- e 2niu (1.21) 
we see, on considering the partial sums of geometric series, that 
„ q for q/n, 



Ye(n-) = 



Vg € N, n € 



for q \ n 

The continuous analogue of ( 11.221 ) is 
l 



r 1 for n = 0, 

I e(nx)dx — { n e . 



1 for n + 0, 



(1.22) 



(1.23) 



Since e{ — ) has period g, (11.221) may also be written as 

q 



V / ^ j?for#|n, 

> e(n-) = < Vq 



eN.ne 



(1.24) 



where £ runs through a complete system of residues modulo q. The 
corresponding result for (11.231) is 



a+l 



I 



1 for n = 0, 

e{nx)dx = { VneZ,aef. (1.25) 

Oforn + 0, 



For any a' f s e 



t 



Y\Yj a ^—^= Z fl/i5fa2«((^i-^)-), (1-26) 

n=\ t=\ 1 t u e 2 =\ n=l f 



and because of \i\ - £2 1 < q in the innermost sum, ( 1 1.221 ) gives 

- Z 1 Z = Z Va ^ 6 c (L27) 



=1 £=1 



2. Ramanujan's function 
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Similarly, by dl.23t . assuming that Z ra \ci n \ 2 < 00 (so, in particular, 
for any finite range for m), we have 

1 1 

I I ^ e(nx)\ 2 dx - ^ a ni a„ 2 I e{(n\ - n 2 )x)dx = ^ \a n \ 2 . (1.28) 

In view of (11.25b . it is obvious that (ll.28t remains true if the integral 22 
is extend over any interval of length 1 . 
Ramanujan's sum is defined by 



c q {n) : Z e(np = Z e{n 1 -), V^M,«e 

tq q 

and where 2 : Z 

£=1 f=i 



(1.29) 



and Z means summation as i runs through a reduced system of 

t mo&q 

residues modulo q. In order to compute c q {n) we use dl.20t . Noting 
that always 

Z /(*=.£/(§) (L30) 

d\q d\q 

<7 



and writing I - l\. — in the last step below we find, by dl.20t . 

a 



c q {n) = 1 e(nf ) £ //(J) - Z M<0 Z - 

£=l y % <% £=1 y 

t=0modd 

= ZMf) Z = ZMjJ) Z 



4? 



£=1 
£=0mod§ 



(1.31) 



so that, by (I1.22I) . we have 



€ N, 72 € 



d\n 



(1-32) 
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We note the following two special cases. For n - 0, we see from 
dl.29t that Ramanujan's sum becomes Euler's function (p(q), and (ll.32t 
leads via dl.30t to the well-known formulae 

W) := £ 1 = c 4 (0) - 2>(f>* - - 

= ? z ^ = ? n (-J + D = ?n(i-^), 

pl«(?) pi? 

23 where we have used also dl.lll ). dl.121 ). (I1.6t for z = and z = and 
z = — 1, and fll.181) . Next, for («, g) = 1, the right-hand side of dl .32b 
reduces to fi(q), so that via (ll.29t we also obtain 

H(q) = c q (n) = ^ ' if ("> ?) = l«eN,neZ. (1.34) 



3 Dirichlet's characters and Gaussian Sums 

For each q e N we define the arithmetic functions, names 'characters 
modulo q\ 

X(m)(e C), Vm € Z. (1.35) 

For an elementary introduction of these functions one requires fol- 
lowing four properties dl.36t through dl.39i : 

X (l) = 1 (1-36) 
X{mn) = x(m)x(n) Wm,n e Z, (1-37) 

i.e.,^ e m without the restriction (m, n) - 1, 

x (n) = xif) for n = £modq (1.38) 

and 

X(n) = for (n,q) > 1. (1.39) 
The relations (ll.38t and dl.36t imply 

x(fri) — 1 for m = lmodg. (1-40) 



3. Dirichlet's characters and Gaussian Sums 
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By ( 11.39b any character vanishes for all n which are not coprime 
to q. On the other hand, for (n, q) = 1, we have by Euler's theorem 



Ml) 



1 mod q, so that (11.37b and dl.40b give 



(x(n)f (q) = l for (n,q) = l, 
i,e., for (n, q) = l,x(n) is a (p(q)-th root of unity; in particular. 

\x{n)\ = 1 for (n,q) = l. 



(1.41) 



(1.42) 



It is obvious that the function 

Xo(n) = 



1 if (n,q) = 1, 
if (n,q) > 1, 



(1.43) 



is a character modulo q; it is called the principal character mod q. A 
simple formula is 



■3 -1 

2^) = Z'^ ) = 



\tp(q) for* = Xo, 
\oforx ±Xo 



(1.44) 



The first statement is immediate from ( 11.39b . and for x — Xo the 
second statement follows by (II .43b . Next for x - Xo there must be a 
number m such that 

{m,q)=\,x{m)±\. (1.45) 

So when t runs through a reduced residue system mod q then the 
same does mt too. Hence, in view of (II .38b and (11.37b . we have 
/ / / 

X(m) J] x(£) = ^ Xitrn) = J] xW>, (1-46) 

l mod ^ £ mod q I mod g 

which implies the second statement of (11.44b for x ^ Xo because of 
X(m) + 1. 

It can be shown that for any given q e N there are exactly (p(p) dis- 
tinct functions fulfilling (11.36b through (11.39b . i.e., there are <p(q) char- 
acters mod q, a fact that can be stated in the form 

2 1 (1.47) 

^mod^r 
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It is easily checked that with x(ri) also x(n) is a character mod q. 
Also, with^i, X2 the function ^i, xi x% a character modg. The sec- 
ond remark leads us to the following statement. If for certain characters 
x', x" i X\ modg, x'( n )X\( n ) - x"( n )Xi( n ) holds for all n, then in view 
25 of dl.42t and dl .391 one has x' - x" ■ Hence, if x\ is a fixed charac- 
ter modg and^ runs through all characters modg, then^i^- also runs 
through all characters mod q, a fact which can be expressed through 

Xi(m) ^ x(m) - ^ (XiX)(m) ^ x(m), Vm e Z. (1.48) 

X mod q x m °d 1 X m °d 1 



This leads us to the following counterpart of (11.44b : 



X mod q 



J tp(q) for m = 1 mod q, 
for m £ 1 mod 17. 



(1.49) 



For, when m = 1 mod g this assertion follows from dl.40t and (ll.47t 
and for (m, 9) > 1 it is trivially true in view of dl.39l >. In this remaining 
case 

m + 1 modg, (m, q) = 1, (1.50) 

we need the result that, for any m subject to (11.501 . there is a character 
X\ modg such that^i(m) + 1, and then the result follows from dl .48l >. 
Next we note the following equivalence: 



X(n)xi (n) = Xo(n), Vn & xi (") = X(n)- Vn. 



(1.51) 



This is clear for (n, q) > 1. For (n, q) - 1 we multiply on the left by 
X(n) and use dl.421 ) and dl.43t (the latter implies always that xxo - x)-> 
and this step can be reversed because x(n) £ 0. Since XX\ i s a character, 
it may be used in d 1.441 . and because of dl.37l ) and dl.51l > we obtain 



Y J 'mxii j o = 



\ifi{q) fov X =X\, 







for X *X\- 



(1.52) 



Finally we prove that 



2 mx(n) 

X modq 



\tp(q) for n = £modq, . 



1 for n £ t mod q 



if (€,?) = !. (1.53) 



3. Dirichlet's characters and Gaussian Sums 
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26 In view of dl.39t we may assume that (n,q) = 1. Since {€, q) - 1, 

we can determine £' such that 



(f,q) = \,W = lmodg, 



(1.54) 



and hence by d 1.401 ) and dl.37t one has x(€)x(£') = 1, which yields after 
multiplication by (£(£))> in view of (11.42b . that^-(f) = ^f(^')> so that, by 
using dl.371 ) again follows 



^ mod ^ ^ mod 

On the otherhand, by the definition of we have 
i'n - 1 mod q o n = £ mod g. 



(1.55) 



(1.56) 



Hence (ll.53t follows from dl.49t with m - I'n, using (ll.55t and 



U36t 



From (ll.53t we now derive the following analogue of dl.27t : 

I a i 

— ~ 2 \Y,xWu e \ 2 = J]'\u { \ z , Mu c eC. (1.57) 

Keeping dl.39l > in mind, we see that the left-hand side equals 

-— - ^ ^] x{£)x{n)u e u n - 2] u e u n {—— ^ x(t)x{n)). 

(1.58) 



^ mod g f ,n= 1 



^ mod^ 



so that dl.57t follows form dl.53l) . 

If d\q and^i is a character mod<i, then 



X(m) := 



\Xi(m) for (m,<7) - 1, 







for (m, q) > 1 , 



(1-59) 



is a character modg, and we say that xi mod<i induces the character 
^modg. If ^mod^r is not induced by any character x\ mod J for any 
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d < q,x is then called a primitive character mod q. The smallest /, f\q, 
such that a x* mod / induces x mod q is called conductorof x- 27 
For any character x mod q the 'Gaussian sum' is defined by 



t(x) ■= YxWei-) = S Xim-X (1-60) 

£ 

since both x(Q an d e ( - ) are of period q. By dl.39l l. I runs actually 

through a reduced system of residues modg, and so does nt also if 
(n, q) = 1. Therefore it follows, by using dl.601 ) for^, dl.371 ) and dl.421 . 
that 

- chi(f)e(n-), for any x^odq and (n, q) - 1. 

f mod ^ 

(1.61) 

It requires a little more effort to prove that for primitive characters x 
dl.6U holds even without the restriction (n, q) = 1, i.e„ 

X{£)e(n—), for primitive y mod g, « € Z. (1-62) 

c mod q 

If we take - x(fy i n dl.27l ). it follows from dl.62t that for any 
primitive character x mod q 



3 

or 



1 q 

- S |rCek(n)| 2 = <p(q) (1.63) 



I t (y) 2 I - q for primitive ;^modg. (1-64) 
If x is not a primitive character, let it be induced by 

X* mod f,q = rf, (1.65) 

where / is the conductor of x- It can be shown that 

r(y) = if (r, /) > 1 (1.66) 
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and that 



r(x)\' = S(r)\ T (x*)\ 2 = H z {r)f, if (r, f) = 1, 



(1.67) 



by dl.64t . since x* is a primitive character mod/. Collecting together 28 
the results dl.64t . dl -66l > and dl.671 ). we have, for the Gaussian sums, the 
following 

Lemma 1.1. If f is the conductor of xmodq, then 

q = rf (1.68) 

and 



\T(xf\ = 



fi 2 (r)f for (r,f) = 1, 







otherwise. 



(1-69) 



We close this chapter with another application which demonstrates 
the usefulness of our characters. 

Lemma 1.2. For any a' n s in C and any for character xmodq put 

S(x;q,£): = ^ a n , S{x,x) ■= ^ a n x{n). (1.70) 



n<x 
n=t mod q 



Then 



V '\S( X ;qJ)- S -^f = 4- V \S( X ,x)\ 2 (1-71) 



Proof. First we note that, by < ll.53l > (with x being replaced by x\ fol- 
lowed by taking the complex conjugate) and dl.521 . we have 



Z x{l)S(x;q. 



= Z *W Z flnjfej 2 *ltf)*l(n) : 
t=\ n<x ™' ximodq 

= Z fl» Z Z 

= S(;c, %). 



(1.72) 
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Next, taking U[ - S(x; q, €) — in (ll.57t . it follows that 



<p(q) 



S' 

e=i 



S(x\q.€)- 



S (x,Xo) 



<p(q) 



S(x, X0 ) 2 



(1.73) 
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According to ( 11.721 ) and dl.44t . we have 



S (x,xo) 

<p(q) 



) = 



o 



for at =^o, 



(1.74) 



and using this in (11.731) we obtain (I1.71I) . 

For the theory of prime numbers the important special case 

Wx;q,€):= ^ Mn)Mx,x)-J^Mn)x(n) (1.75) 



n=x 
n=C mod q 



n<x 



yields, on nothing that A(n) is a real-valued function, the identity 



^'((A(x;<?,, 



&(x,xo) 

<p(q) 



This formula was the starting point in the proof of the mean-value 
theorem of Davenport and Halberstam [2] (cf. ( I6.34I )). 

NOTES 

(11.271) : As another example, take in dl .27ft 



fl for(^) = l, 
far(£g)>l. 



Then, in view of dl.291 ). 



(1.77) 



n=l 



3: For the proofs omitted in this section we refer, for instance, to 
Davenport [1] (Chapter© and Huxley [7] (Chapter©. 



Chapter 2 

The Large Sieve 



WE START by recalling the notation introduction in Chapter|U](cf. dUTTT ) 30 
and d0.52t ). Let y be a set consisting of S integers from an interval of 
length N: 

y c{M,M + N],M eZ,N en,S := \y\. (2.1) 

and let 

T(x) := J] e(nx). (2.2) 

ney 

Then, as has been indicated in Chapter |U| the large sieve in its basic 
form is concerned with the estimation, of the expression 




(2.3) 



in terms of N, Q and S , of the form 




(2.4) 



The simplest approach to a bound of the type in (12.4ft is now due to 
Gallagher Q. Gallagher's starting point is the following lemma which 
occurs in the earlier work of Hardy and Littlewood, and of Sobolev (for 
several variables): 

29 
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Lemma 2.1. Let u and 8{> 0) be real numbers, and let f(x) (complex- 

8 8] 



valued) be continuous on 

8 8 
(u — —,u+ -). Then 
2 2 



U , U H 

2 2 



with a continuous derivative in 



\f(u)\ 2 < J \f(x)f(x)\dx + cT 1 J \f(x)\ 2 dx. (2.5) 



Proof. Put 

F(x) - f(x). 
Then, by partial integration, we rind the identity 



(2.6) 



F(u) =S~ 1 J (x(u - ^))F'(x)dx + 8- 1 J (x-(u + ^))F'{x)dx+ 
«-| » 

/ F(x)Jx. 



(2.7) 



Hence 



" ' 2 " ' 2 

< X - J \F'(x)\dx + 8~ l J \F(x)\dx, (2.8) 



which gives (12.51) on using ( 12.61) . 

Gallagher's use of the above lemma enables one to obtain a bound of 
the type in (12.4b when the expression on the left there is extended in two 
directions; namely, when the terms of the sum in (12.2ft have weights (ar- 

bitrary complex numbers) a n and the set of (fractional) points - in ( 12.4ft 

are replaced by a finite set of real numbers which are distinct modulo 
1. □ 
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Theorem 2.1. For any complex numbers a n , M < n < M + N, set 

U(x) := ^ a n e{nx). (2.9) 

M<n<M+N 

Let x\ , . . . , Xr be real numbers which are distinct mod 1 and put 
6 := rnin||x r -x,||, ifR > 2,||x|| := min|x-&|;5 := <», if/? = 1. (2.10) 

r, s keZ 
Then 

R 

J]\U(x r )\ 2 < (jtN + 6- 1 ) J] |a„| 2 . (2.11) 

r=l M<n<M+N 

We shall present the proof of this theorem in all its detail. For some 
proofs of large sieve inequalities, in particular, for the proof of Theorem 
12.11 which is based on Lemma |2~T1 (cf . also the proof of ( I2.95t ). it is 
a advantageous (cf. d2.30t ) to consider a shifted interval (usually, an 
interval symmetric about the point zero) instead of the interval (M + 
M + N]. Then the general case is easily derived by reversing the shifting 
procedure. The idea of the last step is contained in the following 

Lemma 2.2. Let N e N, let x\, . . . , xr(R > 2) be real numbers which 
are district mod 1 and put 

8 :- min \\x r - x s \\. (2.12) 

vis 

Set 

V(x):= V b m e(mx),b m e£. (2.13) 

-f<m<f 

Then the inequality 

R 

]T\V(x r )\ 2 < A(N,8) J] \b m \ 2 , Vb m eC (2.14) 

r=\ -f<«<f 

with some (positive) function A(/V, 5) depending only on ,/V and 6 implies 
that for any M e Z, 

R 

J^Mx,.)? < A(N,6) KI 2 Va ;i eC (2.15) 

r=l M<n<M+N 
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where 

U(x) := ^ a n e{nx). (2.16) 

M<n<M+N 

Proof of lemma 1231 If U(x) is defined by <I2. 1 6I >. let us consider 
(V(x): = e(-(M +\^-])x)U(x) = £ a n e{{n- M- \^-\)x) = 

L J M<n<M+N L J 

= Yj b m e{mx) — Yj b m e(mx), 

-[^]< m <_[«+i] +A r -f< m <f 

(2.17) 

where 

AT N 

b >" = a m + M + \i^y ~2 <m - 2" (2 " 18) 

Now |£/(x)| = \V(x)\ for all real x from which we easily see that 
( I2.15t is an immediate consequence of ( I2.14I ). 

Now we are in a position to prove Theorem 12. II 

Proof of Theorem 12. U To start with we dispose off the case R — 1. 
This is easily done in view of d2.10t and Cauchy's inequality: 

\U( Xl )\ 2 <N l a «l 2 - < 2 - 19 ) 

M<n<M+N 

So we assume that 

R > 2 (2.20) 

and further, because of Lemma l2~2l we shall consider V instead of U. 
Since V(x) is of period 1 we can also suppose that 

< xi < X2 < . . . < Xr < 1. (2.21) 

By the pigeon-hole principle follows easily that 

6 < -. (2.22) 
R 

Now, from Lemma lXTl with f = V and u = x r , one has 

W(x r )\ 2 < J \V(x)V'(x)\dx + S~ l J \V(x)\ 2 dx, (2.23) 
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where 



6 6 
I r .— [x r — — , x r + — ]. 



By d2.12t . we have for r + s 

5 < \\x r - x s \\ < \x r - x s \, 



(2.24) 



(2.25) 



so that our intervals I r do not overlap and their total length, i.e., length 34 
of 



(2.26) 



r=\ 



equals 

8R < 1, (2.27) 

on recalling (12.221 . Summing now over r in ( 12.231 (and using that both 

V(x) and V'(x) have period 1) we can replace the integration on the right 
l 

over (12.261 by J. Thus we get, by employing Schwarz's inequality. 



2 W(x r )\ 2 < ( f \V(x)\ 2 dx) l/ \j \V\x)\ 2 dx)2 +5~ l f \V{x)\ 2 dx. 



(2.28) 





Now, it follows from dl.281 ) that 



f |V(x)| 2 </x = ^ 

^ M . M 



and also 
l 



f |V'(X)| 2 ^ = J] |2fe m 7T m | 2 < tfV ^ |fc„ 



Hence ( 12.281 yields 



(2.29) 



(2.30) 



^]\V{x r )\ 2 < (nN + 6- 1 ) J] l^l 2 - (2- 31 ) 
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This is (12.141) with 



A(N, 5) = nN + ST 



(2.32) 



Therefore, by lemma |2~21 ( 12.151 ) with ( 12.321 1 gives ( 12.111 ) thereby 
completing the proof of Theorem 12. II 

Now we discuss the result of Theorem 12. II Due to the presence of 
35 the factor cT 1 the efficiency of (12.111) . as of other large sieve inequali- 
ties that we shall consider, depends on the information as to how 'well- 
spaced' (in the sense of (12.101) the points x r are. 

The simplest case is, for any (2 <)R € N. with 



x r = -, \<r<R 



(2.33) 



so that 



Now Theorem 12. 1 1 gives 



(2.34) 



r=l 



J>(£> <(nN + R) £ 



M<n<M+N 



(2.35) 



In the more interesting case 



-, 1 <£<q< Q, (€,q) = 1, 
<7 



(2.36) 



the set of Farey fractions of order Q, the points are not quite so well- 
spaced. We find, on assuming that Q > 2, for any two distinct Farey 
fractions in (I2.36ll 



lq' - q£' 



qq' 



1 1 

> > — 

- qq' ~ 02 



(2.37) 



i.e. 



6 > 



Q 2 

Hence we get from Theorem 12. H and ( 12.191 ) the following 



(2.38) 



35 



Theorem 2.2. Let M e Z, N € N and let a n {M < n < M + N) be 
arbitrary complex numbers. Then 



YjYj' W{ ~^ - {nN + Ql) Z i««i 2v e eN ' ( 2 - 39 ) 

( y<g e=\ 1 M<n<M+N 

where U (x) is defined through \2.9\ . 

Regarding the quality of the preceding results we make the follow- 
ing remarks. Under the assumptions of Theorem 12. II recall the estima- 
tion ( I2.15t : 

R _ 

Y,\U(x r )\ 2 <A(N,6) ^ \a n \ 2 . (2.40) 

r=\ M<n<M+N 

In some application the bound in (12. 1U is quite satisfactory. How- 
ever, if we are inserted in better estimates we need apply more effective 
tools in the proof. 

In order to see some necessary conditions for having a general result 
of the form (12.401 first note that if x\ = and a„ - 1 for M <n< M+N 
then the left hand side is at least |£/(;q)| 2 = N 2 , so that 

A(N,6)>N. (2.41) 

Next, since 6 is invariant under a translation of the set x\, . . . ,Xr by 
any given x e R, ( 12.401 1 would also simply that 

R 

J] \U(x r + x)\ 2 < A(N, 6) J] \a n \ 2 for every x € R. (2.42) 

r=\ M<n<M+N 

Integrating this with respect to x over an interval of length 1 and 
using ( 11.281 . we see that 

R 1 



R J] \a n \ 2 = J] f \U(x r + x)\ 2 dx<A(N,S) £ \a n \ 2 . 

M<n<M+N r=l ~ M<n<M+N 

(2.43) 
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Therefore from our example ( 12.331 ) of equally-spaced points follows 

that 

h(N,6) > cT 1 . (2.44) 

Furthermore, Bombieri and Davenport [ 3 ] have given examples from 
which one gets 

A{N,S) > N + 6' 1 - 1. (2.45) 

These remarks can be considered as negative ones. 

In the positive direction the following result, due to Montgomery 
and Vaughan [2], leaves only a small gap when compared with (I2.45t . 
Since N is usually large in applications this difference in minor. 

Theorem 2.3. Under the hypotheses of Theorem \2. 1\ we have 

R 

r=l M<n<M+N 

(In what follows we assume that R > 2 (cf. (I2.20I) ).) 

Their proof uses the principle of duality due to Hellinger-Toeplitz, 
namely that for anRxN matrix (c m ) with complex entries and a constant 
A 

N R R 

^ I 2 c rnvA 2 < A ^ \v r \\ Vv r € C (2.47) 

n=l r=l r= 1 

implies that (and is implied by) 

R N N 
2 I 2 C mWn\ 2 < A J] |w,-| 2 , Vw„ € C (2.48) 

r=l «=1 «=1 

and the following extension of Schur's result regarding Hilbert's in- 
equality (Montgomery and Vaughan [ 1 ]): 

Lemma 2.3. Under the hypotheses of Theorem M. 1\ we have 

R R R 

| V V u r u s cosec(n(x r - x s ))\ < cT 1 V \u r \ 2 , Vu r e C. (2.49) 
r=l s=l r=\ 



37 



Proof of Lemma lOl First we can impose the normalization condition 

R 

£k| 2 = l. (2.50) 

r=l 

Also we can assume, since the double-sum in ( 12.491 ) is a skew- 
hermitian form, that the (u) which makes the left-hand side there maxi- 
mum satisfies 

u r cosec (n(x r - x s J) - iAu s , 1 < s < R (2.51) 

r=l 

rts 

with some (real) A. Thus it suffices to show (under ( 12.501 and ( I2.5U ) 
that 

\A\ < 6' 1 (2.52) 



Further, we can assume that all the x r 's lie in the interval (0, 1] (with- 
out any loss of generality). 

We have, by d2~T3l and d230l . 

R R 

Ul 2 = Z I Z ur cosec {n(x r - x s ))\ 2 = 

s=l r=\ 
R R R 



(2 53) 

X E X u rUt cosec (7z> - x s )) cosec (x(x t - x s )) = v ' ' 

s=\ r=\ t=l 



where 



R R 

Zi = X I X \u r \ 2 cosec 2 (n(x r - x s )) 
s=l r=l 

and 

R R R 

2Z2 - E Z u rUt Tj cosec (n{x r - x s )) cosec (n(x t - x s )). 

r=\t=\ s=\ 
r+t s+r 
s+t 

(2-54) 
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Using the identity (if R > 3-for R - 2 the inequality ( I2.63t below is 
trivial, because, then £2 - 0) 

Jcosec 6i cosec 6 2 = cosec(6>i - 6» 2 )(cot 6 2 - cot 6»i), if 6162(61 - 6 2 ) + 0. 
I -n < 6 u 6 2 ,6 l -6 2 < n, 

we see that 

E = E-E- (2 - 55) 

2 3 4 

where 

R R R 

2Z3 = Tj Tj u rUt cosec(n(x r - x t )) £ cot(n(x t - x s )) 
r=\ t=\ s=\ 
r+t s+r,s+t 

and (2.56) 

R R R 

IZ4 = Z Z M r"< cosec(7r(x r - X cot(n(x t - x s )). 
r=l r=l 5=1 

r+t s+r,s+t 

Denoting 

£ r := ^ C0f(7r(x r - *,)), 1 < r < (2.57) 



we will have 

^ - ^ ^ w r w, cosec(7r(x r - x ; ))& ? - 2_j (2.58) 



R R 



3 r=\ f=l 31 



with 

R R 



2^ 2^ 2^ w r w r cosec(7r(x r - x,)) cot(7r(x, - x r )) = Re ^ . (2.59) 

31 r=l t=\ 31 



Now, from (I2.58t and (I2.15t . we obtain 

R R 



E + E = E u ' b < iAu < = iA E ft/|Mf|2 - (2 - 60) 



3 31 r=l f=l 



39 



Treating £ 4 of ( 12.561 ) similarly, we also get 

Z + 5>5>5;-Z=-5;- < 261 > 

3 31 4 41 41 31 

so that, from d2.55t and d2.59l >. follows 

^ = -2 Re ^ . (2.62) 

2 31 

Hence 

R R 

1 2 1 - 21 2 1 - 2 J/ 1 "'"' 2 ' + |Mf|2)l "^W^ - cos)7r(^ r - 
2 31 r=i t=l 

r+t 

(2.63) 

which yields in view of symmetry, by (I2.53t and ( I2.54I) . 

R R 

1/1,2 - Z cosec2 ^ - + 2 I cos(7r(^ r - x t ))l). (2.64) 

r=l ?=1 

Observing here that sin 2 (7r60 = sin 2 (7r||#||) and |cos(7r#)| = 
| cos(7r||0||)| (for any real 6) and employing the inequality 

cosec 2 0(1 + 2 cos 0) < 39~ 2 for < 9 < |, (2.65) 

we obtain further 40 

R R 

\A\ 2 < 3tT 2 £ \u r \ 2 J] ||x r - x r |r 2 . (2.66) 

Since the value of the inner sum here is unaltered when x's are trans- 
lated by integers, we can arrange, for any given x r . translates of all x/s 

(t + r) to lie in the interval (x r — —, x r + -). Then the inner sum in ( I2.66t 

tx> 

is easily majorized, because of ( 12.101 . by 2 £ (<5/) -2 . So (on recalling 

7=1 

d2~50l ) holds 

CO 

|^| 2 < 67r- 2 5- 2 2] J" 2 = <5- 2 . (2.67) 

7=1 
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This is ( 12.521 ) and thereby Lemma l2~3*l is completely proved. 

Now we are in a position to deduce Theorem 12.31 from (12.47b to 



Proof of Theorem 12.31 Taking 

c m = e((M + n)x r ), w n = a n , 



(2.68) 



it suffices for a proof of A2.46I ). in view of (12.471 and (12.481 . to show 
that 



N R 



2 I J] c rn v r \ 2 (N + cT 1 ) J] M 2 , Vv r e C. (2.69) 



n=\ r=\ 



i = l 



Expanding the left-hand side here we obtain from the diagonal terms 
the contribution 

R 

A^N 2 . (2.70) 
The remaining part amounts to 

R R 

^ ^v r vs ^ e(n(x r -x s )), (2.71) 

r=l s=l M<n<M+N 

and the inner sum here is 
-i{e((M+-Xxr-xs))-e(,(M+N+-)(Xr-xs))}cosec(7t(x r -xs)). (2.72) 



We apply d2.49t twice with the choices 

1 1 

u r = v r e((M + -)x r ) and u r - V r e((M + N + -)j r ), 



(2.73) 



1 



Then, because of the factor - in A2.72I ). we obtain that the contribu- 
tion ( 12.7 U is 

R 

< cT 1 V \v r \ 2 . (2.74) 
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and this in combination with ( 12.701 ) yields ( 12. 691 ) . 

Montgomery and Vaughan [2] have also obtained a more sophis- 
ticated form of the large sieve, which has turned out to be extremely 
powerful in arithmetical applications. The weights attached here enable 
one to take care of the irregular spacing of Farey fraction (cf. Theorems 
l2~5landl2~6lbelow). 

Theorem 2.4. Under the assumption of Theorem M. 1\ put 

8 r := min||x r - x s \\. (2.75) 

Then 

J](N + -6- r l y l \U(x r )\ 2 < £ \a n \ 2 . (2.76) 

r=\ M<n<M+N 

The proof is very similar to that of Theorem 12.31 The essential 
change is the following version, involving 8' r s, of Lemma |^] (Mont- 
gomery and Vaughan [ 1 ]): 



Lemma 2.4. Under the hypothesis and notation of Theorem MAX there 
holds 

r r ^ R 

| 2_j ^ M r% cosec (n{x r - x$))\ < — ^ \u r \ 2 5~ l , Vu r e C. (2.77) 

r=l s=l r=i 

Here we shall only conclude Theorem 12.41 using (12.47b . (12.48b and 

Proof of Theorem 12.41 Now we put, instead of (I2.68I) . 42 

3 1 _i 

Cm = (N + -5/) 2e{{M + n)x r ),w n = a n . (2.78) 

Then the diagonal terms, from the expression on the left of d2.47b . 
contribute 

n Yj {N+ i 5 r l )~ l M 2 , (2.79) 
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and the remaining part is 

R R 

v..(N + 



3 3 

^^v^ + ^'p^i-j;')^ ^ e(n(x r -x s )). (2.80) 

r=l i=l - ~ M<n<M+N 

r+s 



Again the inner sum is given by (12.72b . Now we apply < !2.77t > twice 
with the choices 

3 , i 1 3,_i 1 

u r = v r (N + -5T r ') *e((M + j)*') and K r = v r (N + -6 r ) M(M + + 

(2.81) 

Then, because of the factor - in A2.72I ). we obtain that the contribu- 

2 

tion (12.80ft is 

^l^VrhN + ^S; 1 )- 1 ^ 1 , (2.82) 



Now, this together with d2.79t proves (I2.47t with A — 1 for our 
choice (I2.78t of c rn . Therefore d2.48t with the above choice d2.78t of 
w n yields (I2.76I) . thereby proving Theorem l2.4l 

Let us now specialize again, We assume that Q > 2, because the 
theorems that follow are trivially true for Q — 1 as before (cf. d2.19t ). 
Take 

x r = -,\<e<q<Q,(£,q) = l, (2.83) 
so that we have (cf. d2.37t ). for any two distinct Farey fractions of 



q q' 



1 1 1 

> ^ _ 



(2.84) 



qQ ~ 2 2 

which shows that in Theorem 12.31 and Theorem 12.41 the quantities Q~ 2 
and q~ l Q~ 2 are permissible lower bounds for 8 and 5 r , respectively. 
Therefore we obtain from these theorems 



Theorem 2.5. For any complex number a n , put 
U{x) := a n e(nx). 



(2.85) 



M<n<M+N 
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Then 



ZZ' 

q<Q t=\ 



U(-) 

q 



<(N+Q 2 ) J] 



(2.86) 



M<n<M+N 



and 



3 f 

Y^ N+ 2 qQyl Yj' W{ - )][2 - Z K|2 - (2 - 87) 

?<Q e=l Q M<n<M+N 

Finally, returning to the beginning of the this chapter, i.e., 



1 1 if n e y, 
if n £ y, 



(2.88) 



where U(x) = T(x), we note that Theorem 12-51 contains . with respect to 
d2~4l . the following 

Theorem 2.6. Lef y be a set of S integers from an interval (M, M + N] 
and put 

T( x ) := V e(nx). (2.89) 

ney 



Then 



ZZ' 

<?<Q /=! 



r(-) 



<(N + Q Z )S 



and 



E< W+ >"'Z' 



q<Q 



T(-) 
1 



< S. 



(2.90) 



(2.91) 



NOTES 



The (explicit) qualitative version of ( 12.391 ) occurs for the first time 
in Bombieri [ 1 ] with the factor 7 max(N, Q 2 ). His method may be con- 
sidered to be a refinement of Linnik's, 44 

An improvement of Bombieri 's factor as well as the extension from 
Farey fractions to well-spaced points, i.e., (12.40b . is due to Davenport 
and Halberstam Their substantial improvement of the method is 
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based on a convolution of U(x) with a suitable auxiliary function, an 
idea introduced by Roth [2]. Subsequent developments and discussions 
along these lines were given by Bombieri and Davenport [ 2 ], [ 3 ] (and by 
Liu Q). The paper of Bombieri and Davenport contains also various 
investigations under different assumptions about the relative orders of N 
and 8. In particular, they have proved the only result that still supersedes, 
under certain conditions, the Theorem 2.3; namely 



A(N,5) <5~ l + 27(W 3 (5 2 if NS < ^, 



(2.92) 



which is superior to (I2.46t if (NS) 2 < ^Jq. 

The best general result known to date, Theorem l2.3l is due to Mont- 
gomery and Vaughan [2]. They have pointed out (cf. Montgomery and 
Vaughan [ 1 ]) that it is possible to replace the factor 8~ l in d2.49t . and so 
also in (12.46b . by 6~ l -c for some c > 0. Indeed, from the remarks subse- 

i can be more precisely 



quent to (12.66b we see that the inner sum in (JZf 
estimated as follows: Let J\ and denote the number of translates of 

x' t s which lie in the intervals (x r - -, x r ) and (x r , x r + -] respectively, 
so that J\ + J2 - R — 1 . Then the above-mentioned inner sum is at most 
(cf. dTTfjl ) 



j, 7, 00 
; =1 ; =1 j 2 +i 



(g+i) 



(Ji+1)(7 2 +1) 



— ) < S~ 2 (- —) < —6~ 



(2.93) 



45 on using R > 2, which implies in view of ( 12.221 ) that (R + 1) < -6 l . 



Now, on combining i2. 661) . ( 12.931 ) and (12.501) we obtain 



\a\ 2 < 6~ 2 - -5~ l < or 1 - -). 



(2.94) 



Thus one can take, for instance, c = — ■ in the above remark. How- 

n 1 

ever, according to (12.45b . it is not possible to obtain these general esti- 
mates with any c > 1. The essential tools for the proofs of the Theorem 
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I2.3l and l2~4l were developed in Montgomery and Vaughan [ 1 ] (cf. Mont- 
gomery |6l). Earlier similar approaches had been discussed in the work 
of Elliott Mathews O, Q, 0, and Kobayashi Q. 

An intermediate result of Bombieri ([4 1 and p. 17 of [ 6 ], namely 



A(N,S~ l ) <N + 2S'\ 



(2.95) 



is based on Theorem 10.11 Under the assumption of Theorem 12.11 con- 
sidering the sum Yi-n a n e{nx) (cf. Lemma l2~2l instead of U(x), he takes 
the Hilbert space I 2 of sequences a = {a n } with (a, ft) := £-oo ajfi n , 
IMI 2 = Z-co M 2 . Choosing 



LeN,/: 



one gets 



a„ if \n\ < N, 
if \n\ > N, <p r 



e(-nx r ) if \n\ < N 

(N±LMy/2 e( _ nXr) if N< \ n \< N + L , 





if \n\ >N + L, 



(2.96) 



N 



N 



= ^ M 2 , (f, <Pr) = ^ a„e(nxr), r = l,...,R. (2.97) 



-N -N 

Bombieri proves that for each r, 1 < r < R 

R n 2 y 

Y u \(<Pr,<p s )\<2N + L + — .— , 



5 = 1 



(2.98) 



Now d2.95l > follows from Theorem 10. II or from dQ.59t . by taking 



(2.99) 



OB :or||X|| :=\x-[x+-]\. 

(12.411 1. (12.44b : cf. Montgomery and Vaughan [2|. 

(12.471 1. (12.48ft : cf. Hardy, Littlewood and Polya, Cambridge Theorem 
288. 
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2. The Large Sieve 



1 12.5 It : cf. Mirsky, L., An Introduction to Linear Algebra{Oxi brd), 
p. 388 Theorem 12.6.5 . 



C65t : We rewrite (12.65b in the more convenient form: 



3 sin 2 6i > 6> 2 (1 +2cos<9) for < 9 < ^ (2.100) 
Now, from the series expansion of sin 9 and cos 9, we have 

sin6>> 9- -6> 3 ,cos6>< 1 - -9 1 + — 9 4 for < 9 < -, (2.101) 
6 2 24 2 V ' 

since (m!)" 1 ^ > ((m + 2)!)" 1 6' m+2 , if m > 1 and 9 2 < 6. This gives 
(12. lOOt on verifying 

3(1 __)2 = 1 + 2 (i -_ + _). (2.102) 

( 12.761 ): In connection with this very effective form of the large sieve 

we have in Montgomery and Vaughan [ 2 ] the remark that it may be that 
3 



the constant - in (12.771 k and consequently also in J2.76t . can be re- 

^12+^ 

placed by 1. Actually, their work contains the constant (= 

n 

1.45282 • • • ) (and some slight improvements) instead of the aforemen- 

3 _ ■ . 

tioned constant - (cf. notes of Chapter |8| under ( I8.36t ). For a previous 

result of this type, see Montgomery [5 ] (Therefore l4.lt . A first weighted 
form of this sieve occurs in Davenport and Halberstam Q (cf. also Dav- 
enport [ 1 ] Liu [ 1 ] and Montgomery |5 ]. (Theorem l2.4t . 
47 By comparing the results of this chapter with those of the previous 

forms d0.54t gives in Chapter|Hl it is natural to ask the following question 
(Erdos [3 1) Consider (in the notation of Ch. 

J] ^ |7\-)| 2 < cm Q)S, (2.103) 

p<Q 1=1 P 

where now only a 'negligible' proposition of terms remain on the left- 
hand side. Then are there better results for C\{N,Q) than those for 
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C(N, Q), (cf. (I2.4t ) or, more specifically, can one expect a gain of a 
factor log Q here? Erdos |3) (cf. Erdos and Renyi [ 1 1) proved that if 
Q < this is true for almost all sets y. Wolke (cf. Wolke 0) 
has proved a slightly weaker estimate which holds also for (more gen- 
eral) U(x), (cf. d2.9t ). and for all sets y, but under the severe condition 
N < <2(log Qf for some 6 > 0. On the other hand Erdos [ 3 ] (cf . Erdos 
and Renyi [ 1 ]) has shown that C\(N, Q) is of the same order of magni- 
tude as C(N, Q), if Q is of a higher order than ^JN log N. For further 
literature in this connection, see Elliott 151. 171. The result is generally 
speaking, that (except under special circumstances) we cannot have a 
better estimate in (12. 103ft than that for d2~4l 

Another attempt, at sharpening the large sieve, is due to Burgess 
(HI). He proved that for any set £ U N 



YjTj' ]U W «(Q\£\W + (Q\J2\)X ^ M 2 ), (2-104) 

q<Q 

qeJ2 



q<Q £=l % M<n<M+N 



from which saving is made when J2 is a sparse set. (Note also that here 
U occurs to the first power on the left-hand side.) 

In the other direction one may ask for general lower bounds, for 
instance in (I2.103I) . In view of (10.531) such results would mean that 
'general sequences cannot be too well-distributed in almost all arith- 
metic progressions'. The first result in this context is given by Roth [ 1 ], 48 
For further literature concerning this question, we refer to Roth [3], [5], 
Choi 0, Montgomery (Chapter |5J) and Huxley 0. Wolke [10] (cf. 
Wolke [9 1) has stated the explicit lower bound (in the notation dl.29t ) 



q<Q 1=1 H q<Q YKHJ 



2 

for any set J2 c 



M<n<M+N 
q&S q€g 

(2.105) 

However, (I2.105t should rather be considered as a reduction of the 
expression on the right-hand side to the large sieve (cf. d3.28t ). 



Chapter 3 



The Large Sieve for 
Character Sums 



THERE IS another version of the large sieve which concerns with the 49 
averaging of character sums (cf. (13.21) below). In this chapter we give 
three such results which are readily obtained from Theorem 12.51 We 
prove first 

Theorem 3.1. Let Q e N. For any character x mod q and for any com- 
plex numbers a n , satisfying 

a n = 0, unless (n,q) = \Wq < Q, (3.1) 

write 



X(x):= J] a nX (n). (3.2) 



M<n<M+N 

Then, we have 



Xi-TT 2 l T <*)l 2 |X(*-)l 2 ^+<2 2 ) 2 \a n \\ (3.3) 

q<Q X moA 1 M<n<M+N 

X^ + ^er 1 -^ 2 \t(x)\ 2 \x(x)\ 2 < Yj |a » |2 ' (3 - 4) 

q<Q x^vodq M<n<M+N 

49 
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3. The Large Sieve for Character Sums 



and 

Q 



J] log- £ \X(x)\ 2 < (N + Q 2 ) 2 \a n \ 2 , (3.5) 

q<Q x m °d<l M<n<M+N 

where t(y) is defined by (11.601) and 



Z - Z <3 -« 

^ mod q x m °d 9 
X primitive 

Remark . The condition d3.U is not usually a severe restriction, since 
in applications either this fulfilled or the extra terms arising in the other 
case are separately estimated to be small. 

Proof. First of all, for (n, q) = 1, it follows from ( 11.611 ) 

i f 
<p(x)a„x(n) = Y \x(€)a n e(n-), (3.7) 

which holds, by d3.lt . also for (n, q) > 1. Now ( 13.71 ) gives 

i p 

r(x)x(x) = Ymm-), (3.8) 
n q 

where U{x) is defined through (12.9b . Multiplying each side of (13-81 ) by 
its complex conjugate, summing over all character x m °d q and using 



dl.57t with U{ = (-). we get the identity 

a 

i q i 

— ^ \t(x)\ 2 \X(x)\ 2 -2' |C/(_)|2 ' (3 " 9) 

AT mod 9 £=1 ^ 

(cf. Bombieri and Davenport [2]) and further use of this in Theorem l2.5l 
yields J3~3l and @3J. 

Next, for a character ^ mod g let / be its conductor, and let x be 
induced by^f* mod/. Then by the assumption (I3.H (cf. dl.59t ). 

xur) = x(^), (3.10) 
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and using Lemma [TTTl as well as 

-7K< X )^VT) > lo S x for * > °« ( 3 - n ) 
9(f) f£ f(r) 

(rj)=l 

we obtain, in our notation d3.6t . 

™i 2 >z/< e iogf z; mod/ ixar)i 2 . 

(3.12) 

Thus ( I3.5t is a consequence of ( 13.3b . 

Regarding the quality of the results in Theorem 13.11 we note that, 
in view of the identity A3.9I ). the estimates ( 13.31 ) and (13 .4t are capable 
of improvements only along with sharpening of Theorem 12.51 On the 
other hand, the statement ( I3.5t leaves a gap (even through the inequality 
(13.1 11 is capable of an asymptotic formulation. □ 

In the case when only primitive characters occur in both ( 13.31 ) and 51 
( 13.41) our condition (I3.U can be removed to prove the next 

Theorem 3.2. For any character x mod f, r e N and for arbitrary com- 
plex numbers a n , set 

X r (x):= a nX(n)c r {n) (3.13) 

M<N<M+N 

where c r (n) is given by dl.291 ). Then, we have 

f * 

Z -77 E K-wi 2 <(^ + e 2 ) 2 |a » |2 ' (3 - 14) 

r/<Q X mod f M<n<M+N 

(r,f)=i 

and 

r/<g J yK J ' xmodf M<n<M+N 
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3. The Large Sieve for Character Sums 



Remark. Observe that, under the condition (I3.lt . we have c r {n) appear- 
ing in the non-zero terms of d3- 13l > as //(r) and so, (13. 14l > also leads to 
(I3.5t in view of <I3- 12b . 



Proof. We have for any q e N, by dl.57t . 

^ E j^mu& = ±'\U<it. (3.16) 



q 



Now if x mod q is induced by x* mod / (/ conductor of x)- we have, 
on using dl.68t and dl.59t . 



q = rf,tffi=x\*) for (*.?) = !■ 
Therefore, summing 03.161 over q < Q gives 



(3.17) 



rf<Q Y xmodf l=\ H 
(r,f)=l 



(3.18) 



q<Q e=\ 



For any primitive character x mod f,q- rf, (r, /) = 1 and any £, 
(l,q) = 1, on writing 



^ = ^r+A*/,(/l,/) = l,0i,r) = l, 



(3.19) 



52 we will have 



Z'*tf)tf(|) = 



M<n<M+N i=l 



- *W Z o„ E '*0*M» 7 ) Z ' = ^(r)r(Y)^0r), 

M<n<M+N A=l /J=l 



(3.20) 



because of dl.62t and (ll.29t . Since (r,/) = 1, we have (by (11.421 )) that 
\x(r)\ = 1 and further by Lemma ITTTI that |t(y)| 2 = /. Thus (13 .201) and 
d3.18t . on using d2.86t . prove the part d3.14t . The proof of d3.15t is the 

same in the before summing over q in d3.16t we need multiply by the 

3 

factor (N + -qQ)~ l and at the and employ d2.87l) instead of d2.86l) . 
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Since obviously 

ci(n) - 1, (3.21) 

we obtain, by retaining only the parts with r = 1 in the expressions 
occurring on the left-hand sides of Theorem l3.2l as a particular case □ 

Theorem 3.3. For any character x mod q and for any complex numbers 
a n , define 

X(X):= J] a nX {n). (3.22) 

M<n<M+N 

Then, we have 

* 

Z -h Z l^l 2 * ^ + 2 2 ) Z l«nl 2 (3-23) 

<?<Q .vmodg M<n<M+N 



and 



Z(- + 5fi) _1 -(?) Z l X CY)l 2 < Z Kl 2 - (3-24) 

g<Q ^ ^ x m odq M<n<M+N 



NOTES 53 

The version of the large sieve discussed in this chapter occurs for 
the first time in Bombieri [l| (see, however, Renyi 0; cf (I0.32t ). Sim- 
plifications of the proof and improvement of the quality of the result 
were made by Davenport and Halberstam [ 1 ], who also obtained there 
the first weighted form. However, as has been mentioned earlier and can 
also be seen from ( 13.91 ) and (13. 1 81) . the results of this chapter are more 
or less direct consequences of those in Chapter 

A first result with conditions (13.11) was given by Bombieri and Dav- 
enport (and for (I3.5t . see Bombieri [6] (Theoreme 8)) where they 
also prove a generalization of ( 13.3ft which, via Theorem 12.51 under the 
assumptions of Theorem l3.ll becomes 

Z Z |TW|2 ' Z ^X(n)\ 2 <(j + 1 + Q 2 ) 2 \a n \ 2 

q<Q ^ H x moA <l M<n<M+N M<n<M+N 

(q,k)=\ n=t{moAk) 

(3.25) 
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3. The Large Sieve for Character Sums 



whe re k e N and (€,k) - 1. 

d3~TTl : By Chapter [TJ 1. We have that, for * > 0, 

. (n/)=l 

(3.26) 

on using, for x > l,ifA^<x<A^+l, 

I --l°g(^ +1 )>l g^ (3-27) 

n<x 11 n<N „ 

(cf. van Lint and Richert [ 1 ]). 
54 Clearly, analogous to the derivation of ( 13.51) from (13.3ft one can ob- 

tain a corresponding result from (13.41 . In the special case a n = A(n), 
this has been done by Montgomery and Vaughan [2] (cf. ( I6.29t ). 

The extension to sums involving Ramanujan's sum c r (n), namely, 
Theorem 13 . 21 which contains Theorem 13 .31 is due to A. Selberg ([6], cf. 
Bombieri [6] (Theoreme 7A)). 

We obtained (13.23ft from (13. 14ft by keeping only the part with r = 1 . 
On the otherhand, by taking the part corresponding to / = 1, we get 

ZJ^ 1 £ ^(n)| 2 <££'|^)| 2 (3.28) 

q<Q Y H M<n<M+N q<Q t-\ H 

which should be compared with Wolke's result ( (12.105ft ). 

The main importance of Selberg 's generalization (13.14ft is due to 
its application in proving density theorems for Dirichlet's L-functions 
(cf. Chapter^ 2.). There the strongest known results (cf. Montgomery 
[ 8 ], Motohashi [11] and Jutila [ 12 1 are based on the following strinking 
identity again due to A. Selberg: 

L{s,x)M{s,x,^r) = ZZi l?Mn)Zd\n{d,(X -=Xmodf) 
for any set of & e C,& = 0(1). 

(3.29) 
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where 

X(n)^,-(n) i-r x(p) 



n=\ n p\J^ P 

ip r {n) := n{{r, n))tp((r, n)), (3.31) 



and 

H(r) + 0. (3.32) 
Actually, (13.141 is employed in the weaker from 

Z Stt t I Z ^)w«) 2 <(Af+e 2 ) ^ I' 

r/<2 ^mod/ M<n<M+N M<n<M+N 

(r,/)=l 

(3.33) 

55 

In fact, the expression on the left-hand side of ( 13.331 is the part of 
the sum in (13.141) corresponding to squarefree r's, since (under (13.321 ) 
one has one has readily, by ( 11.321 . 

c r {ri) = V //( \ j )d = ^{-!—)cp{{r,n)) = fi(r)</, r {n) Q4r) * 0). 

(3.34) 

Motohashi [ 14 1 has in turn shown that (13.331 can be generalized to 
give the following estimate: Let Xj mod/), fj < F(j = I,..., J), be 
distinct primitive characters. Then 

Z Z ^77^1 Z "nXMWrint <(N + 0(JFR 2 log(FR))) £ l a "l 2 

r<K J<J ™ M<n<M+N M<n<M+n 

(fj,r)=l 

(3.35) 

which, for sufficiently small /, improves upon A3.33I ). 

Slightly more general forms than those of Theorem 13.11 and 13.31 
(namely, without the restriction (13.11 or removing the limitation to sum- 
ming over only primitive characters) are possible but at the expense of 
the quality of the estimates (cf. Bombieri [ 1 ] and Davenport and Hal- 
berstam [I]). 

For an estimate of averages involving real characters only, see jutila 

0. 
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3. The Large Sieve for Character Sums 



Finally, there are result concerning averages of characters sums, 
which are useful when combined with large sieve estimates in some 
applications. They can be obtained without employing results of Chap- 
ter |2](cf. Montgomery [5] (Theorems 6.2 and 6.3)). We mention, as an 
example, (Montgomery [5] (Theorem 6.2)): 



Xmodc 



£ |X(y)| 2 < <p(q)(l + ^— ^ ) £ 



q 



M<n<M+N 
(n,q)=\ 



(3.36) 
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For a proof, split X(x) into 1 + 



N —I 



q 



parts of length q (introducing 



additions a„'s = 0, if necessary). For each part X\(x), say, it results from 
dl.571 . with obvious appropriate choices for U{, that 



2 \Xiixtf = <p{q) £ K\\ 

(n,q)=l 



(3.37) 



where J^- denotes the range of n in Xfa). Now ( I3.36t following on us- 
ing Minkowski's and Cauchy's inequalities. Likewise, but with a more 
complicated yet still elementary, reasoning (Montgomery [ 5 ] (Theorem 
6.3)) one obtains 



2 \X(x*)\ 2 < q{\ + 

X^aodq 



N -I 



q 



\a n \ 



(3.38) 



M<n<M+N 
(n,q)=l 



where for each x m °d q, X* denotes the primitive character which in- 
duces x- 



Chapter 4 

The Large Sieve for Dirichlet 
Polynomials and Dirichlet 

Series 



STILL ANOTHER from of the large sieve,as has been noted Davenport, 57 
can be obtained with respect to Dirichlet polynomials 



N 

^ a n n~ s , s = <t + it (tr, t reals ), a n € C, (4.1) 

n=\ 

as an application of Lemma l2~Tl 

Theorem 4.1. Let Tq, T(> 0) and t r be real numbers satisfying 

T = t Q <t l <■■■ <t R <t R+i = T Q + T, (4.2) 

and put 

8 := min t r+ \ - t r (4.3) 

0<r<R 

Then, we have 

R N 2 N 

J] 2 a n n~ itr < (T+4NlogN)(log N+S~ l ) J] \a n \ 2 , Va„ € C. (4.4) 



r=l n=\ 



»=1 
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4. The Large Sieve for Dirichlet Polynomials.. 



Proof. By ( I4.2t and d4.3b . we have (R + 1)6 < T and so we can suppose 
that N >2. Also, by (H3b . the intervals ' ' 



not overlap. Therefore, taking in Lemma l2~Tl 



tr ~r tr + r 



, 1 < r < /?, do 



/(«) - o^iT* h = f r (l < r < 7?) 



(4.5) 



and proceeding as in (I2.28I) . we obtain 



r +r 



2i/(oi 2 <( J mtdtyi J \f\t)\ 2 dtr-+6- 1 J \f( t )\ 2 dt. (4.6) 

r=1 r r T 



Now 



T +T 



T +T 



f \f(t)\ 2 dt = Tj^ \a n \ 2 + J] a m a n f (-fdt. (4.7) 

i/ N=l m,n=l 4 m 

To m+n To 



The second sum, on noting that 



log — 

m 



> 



\n — m\ 



(4.8) 



max(?t, m) 

and using the arithmetic-geometric inequality, may be estimated by 

2 X 77^7 < # V -J— (K| 2 + |a„| 2 ). (4.9) 
-^ J , log £ n - m 



m,n=l 



m,n=l 



Due to symmetry in m and n (of these expressions), it is enough to 
consider the factor of |a„| 2 on the right-hand side. This equals 

y — — + y — — = y t+ y 7 <2iog^. (4.10) 

, ' n — m ^— i % n — m Z - J k f— 1 k 

\<m<n n<m<N \<k<n \<k<N-n 

Therefore, the expression < !4.9t can further be estimated by 



4N log N^\a n \ 



(4.11) 
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Using this in (14.71 ). we get 




\f{tfdt<(T + 4N\ogN)Y J \a n \ 2 



(4.12) 



Clearly, for /' instead of / ( I4.12t holds with an extra factor of log 2 N 
on the right-hand side; in otherwords, we have 



Now |@3|> follows from (l4~T2l and d4~Ht . 

For applications in the direction of the classical mean-value theo- 
rems for the Dirichlet series, the powerful tools of Montgomery and 
Vaughan [ 1 ], namely (12.491 1 and <2.77t . can be remodelled to the follow- 
ing Lemma. 

Lemma 4.1. Let A\ , . . . , Ar be distinct real number and set 59 




\f(t)\ 2 dt <(T +4N log N)^\an\ 2 



(4.13) 



□ 



A : min \A r - A s \ 



(4.14) 



and 



A,- : min \A r - A s \. 



(4.15) 



Then 




(4.16) 



and 



r=\ i=l r s r=l 



(4.17) 
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4. The Large Sieve for Dirichlet Polynomials... 



Proof. Let e > denote a small number to be suitably restricted below. 
Put 

x r = eA r (l < r < R). (4.18) 



Then, we have in the notation of ( 12.101 ) and ( 14.141 ). for all sufficiently 
small e, 



6 = min||e(i r - A s )\\ = emin|e(^ r - ^)| - eA, (4.19) 



and, similarly in the notation of ( I2.75t and d4.15t . 

5 r = eA r (1 < r < R). 



(4.20) 



Multiplying both sides of (12.491 1 and ( 12.771 1 by ne and further using 
(14.18b . ( 14.191 ) and (14.201 ). we obtain Lemma l4~T1 as a consequence of 



lim 



ne 



1 



»+o sin(7r(x r - x s )) (A r - A s ) 



(4.21) 



We obtain almost immediately from Lemma PTT1 the following 

Theorem 4.2. Under the assumptions and notation of Lemma 14.71 we 
have 



J]a r e ! ' v dt = 2{T + 9mA~ l )Y J \a r \ 2 , Va r 

_ T r=l r=\ 



€ C, (4.22) 



60 and 



-T 

where 



/i R 2 R 

J\ r e Urt dt = Jja r | 2 (2T + W 2 nA~ r l ), Va r eC, (4.23) 
r r=\ r=\ 



\0j\ < 1,J= 1,2. 



(4.24) 
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Proof. The integral of the theorem is 

R L R 

a r a s J e i(A '- As)t dt=2 T J] \a r \ 2 a ^ s ~ /(JT 



r R 



e i(A r A s )T _ e -i(A r -A s )T 



r,s=\ 



r=\ 



r=\ 5=1 



Ur ~ As) 



Now application of ( I4.16t and ( I4.17t with the choices 



U f- — dy€ 



-JA r T 



(4.25) 
(4.26) 



to the double sum in ( 14.251 ) yield ( I4.22t and ( 14.231 ) respectively. 

The most interesting use of Theorem l4.2l is when applied, in its from 
d4.23t . to Dirichlet series via Dirichlet polynomials. Taking 



A r - - log r, 
A" 1 < r+ 1, 



we find, by (14.8b . 

and so get, by (I4.23I) . 

T r 2 R 

I Yj arr ~ u dt = |a ' |2(2r + 3 ° 37ri 



(4.27) 
(4.28) 



r=l 



(r+1)), \9 3 \ < 1, VfleN. 

(4.29) 



Further, if we impose the condition 



2_^r\a r 



< CO, 



(4.30) 



we can conclude from ( I4.29t that the Dirichlet polynomials 

R 

r=l 

converge in the mean to the Dirichlet series 

oo 

2 a r r- u e L 2 (-T, T). 



(4.31) 



(4.32) 
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4. The Large Sieve for Dirichlet Polynomials.. 



Thus we derive from d4.23t the following important result. 
Theorem 4.3. For a n € C, suppose that 



^ n\a„\ 



< oo. 



(4.33) 



Then 



where 



\e\ < 1. 



(4.35) 



NOTES 

Theorem l4.il due to Davenport, was published by Montgomery [2]. 

Lemma 14.11 Theorems 14.21 and 14.31 are due to Montgomery and 
Vaughan [ 1 ] (Theorem 2 and Corollaries 2 and 3). 

d4.lt : Clearly, it is no restriction to have the result of this chapter 
with cr = 0. Theorems 14.21 and 14.31 Obviously, it is possible to obtain 
with the help of Gallagher's Lemma l2~T1 (cf. d4.6t ) results, correspond- 
ing to Theorems 14.21 and 14.31 in with the integral is replaced by a sum 
over a set of well-spaced points. 

d4.32D : For the reasoning leading to d4.32D . cf, Titchmarsh, E.C., 
The Theory of Functions (Oxford), pp. 386-387. 

For a discussion of p(N, T) in the general inequality 

T r N N 

I |J]a n «- ,Y | 2 dt<p{N,T)Y,\an\ 2 , (4.36) 
*J 1 — 1 



n=\ 



n=l 



see Elliott [7]. 



Chapter 5 

The Hybrid Sieve 



FOR SOME very important applications to number theory, via Dirichlet 62 
series, estimations of averages of the type 

£<•■•> t I < 5 -« 

q<Q XmoAq_j 

which is a combined version of the forms of the large sieve considered 
in Chapters |3] and 0] are of much use. Prior to the innovation of an 
ingenious method of Halasz [ 1 ] there was no method of dealing with 
this question without carrying out at least one of the operations or 
J in a trivial fashion. Methods, for the purpose of this hybrid sieve, 
were developed independently by Montgomery 0, combining ideas of 
Halasz with the large sieve, and by Jutila [1], who used a method of 
Rodosskij with large sieve. Subsequently, a common basis for both of 
these was provided by Gallagher [4| through the introduction of new 
technical devices (see Lemmas 15 . 1 1 and l5~2l belo w) . 
We start with 

Lemma 5.1. Let 

D(0:J]c(vMvf), (5.2) 

V 

where v runs through a countable set of real numbers and the coefficients 
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c(v)(e C) are subjected to the condition 

J] | c(v) |< 00. 

V 

Let 5 and T be positive real numbers satisfying 

1 

8T < —. 

In 

Then, for some absolute constant cq, holds 

T oo 

J \D(t)\ 2 dt<c Q J \C s (y)\ 2 dy, 



(5.3) 



(5.4) 



(5-5) 



63 where 



C s (y):5- 1 £ c(y). 



(5-6) 



Proof. For a proof of (I5.5t we use two results from the theory of Fourier 
transforms. Introduce 



so that 



5" 1 if\y\<6/2, 

Fs(y)--\„ . (yeR) 

(J otherwise, 



Cs(y) = ^ c(v)F,y(y - v). 



(5.7) 



(5.8) 



In view of (15.31) . C^(y) is a bounded integrable function and hence 
it belongs to l-a{— oo, co). Therefore, by Plancherel's theorem, C§ has a 
Fourier transform C§ and further, by Parseval's formula, we have 



J \C s (y) \ 2 dy = J \C s (t) | 2 dt. 

— CO —CO 

Now one has 

CO CO 

C s (t) = J C s (y)e(yt)dy = ^ v c(v) J F$(y - v)e(yt)dy = 

— CO —CO 

oo 

= Z 7 c(v)e(vt) J F d (x)e(xt)dx = D(t)F d (t) 



(5.9) 



(5.10) 
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say, on using ( 15.81 ) and d5.2t . Also, by d5.7t . 



612 

i C sin(7r5?) 
F s {t) = cT 1 e{xt)dx = 



-6/2 



ndt 



(5.11) 



Thus (^9J yields 



CO CO 1 

J \C 6 (y)\ 2 dy = J \D(t)F s (t)\ 2 dt< J \ D{t)F 6 {t) | 2 dt. (5.12) 



because of (I5.10t . For \t\ < T, we use d5.11t to note (cf. (15 .4t ) 
, A . v , sin(7T(5r) 1 
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say, so that d5.5t follows from d5.12l) . 



(5.13) 



□ 



Following Gallagher we use Lemma l5~T1 to prove (Gallagher [4], 
Theorem 1). 



Lemma 5.2. For a„ e C, let 



n=\ 



< oo. 



Then, for T > 1, 

T „ 



_7 «=1 o *<w<*e 1/r 



2jx 
X 



holds with some absolute constant c\. 
Proof. In Lemma l5~T1 we choose 

1 



v = log n, c{v) = a n , n e 

2n 



(5.14) 



— (5.15) 



(5.16) 
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and note that d5-3l > is satisfied because of A5.14I ). Further, we put 

6 = 'tf ,y = h {yogx+ ij ) (x>ox (5 - 17) 



so that (I5.4t is fulfilled and the condition of summation in (I5.6t reads 

(5.18) 

Therefore (I5.5t yields (with the above choice) 



5 5 i 1 i 

■-<j-r<-»logx<log/i<-tlogi 



/ CO 

^? < c I 2;rr 2 | J] a n — . (5.19) 

; 1=1 n X<JL<Xe l l T 



-T " =1 

This completes the proof of the lemma. 



Now we shall give two applications of Lemma l5"2l for the averages 
of the type d5.ll) . 



65 Theorem 5.1. For a n e C, let 



< oo. 



(5.20) 



;;=1 



Then, for T > 1, 

T 



Z ~n E 1 1 E a ^ n >" ! 1 * * 2c i E (re2 + ")i a »i 2v 2 e 

?<2 /mod o t n=l n=l 



q<Q 1 ^mod q_j 

holds with the constant ci of Lemma l5~2l 

Proof. We use Lemma l5~2l with a n x(n) instead of a n and then apply 



(5.21) 



y— y 

Z-i tn( q ) Z-i 

q<Q xmoAq 



(5.22) 
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to (the resulting) d5.15t . Then, we have for the squared expression on 
the righthand side of (I5.15t . in the notation of Theorem 13.31 M = [x] 
and N < x(e 1,T - 1) + 1 so that using d3~2~3l it follows that 



C I 2 

CO 

< c,T 2 f(Q 2 + 1 + - l)x( ^ k,| 2 )— . 

J . . 1 IT X 



(5.23) 



Herein, the factor of \a - n\ is 



n n 

\c l T 2 {Q 2 + \) J ^ +c l T 2 (eT' - 1) J~ 



e -l/r 



e -l/r 



= ciT(2 + 1) + ciT 2 (eT - 1)(1 - e'T)n < 2c x (TQ 2 + n), (5.24) 
where we have employed the estimate 

T 2 (e l/T - 1)(1 - e" 1/r ) < 2 for T > 1. (5.25) 

□ 



Now, putting together ( I5.23t and d5-24l > we obtain ( 15.211 1 
Theorem 5.2. Let QeK For a n e C, let 



< oo. 



a«<i suppose that 

a n = unless (n, q) = 1 /or a// q < Q- 
Then, for T > 1 , 



(5.26) 



(5.27) 
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_ * _ oo oo 

J] log - J] I I J] a„x(«>T' Y | 2 A < 2cj J](re 2 + n)\a n \ 2 (5.28) 



holds with the constant c\ of Lemma |5~21 
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Proof. We proceed as in the proof of Theorem l5.1l but applying 

Z-? t 

q<Q mod 4 



(5.29) 



instead of d5.22b . Now the condition d5.27b permits us to use d3.5t of 
Theorem l3.1l to obtain the same bound as in d5.23b for the left-hand side 
of ( I5.28I ). So the proof is again completed by d5.24b . □ 



NOTES 

It is possible to put Halasz's method in an abstract form (cf. Mont- 
gomery |2! (Lemma 1.7) and Huxley [7] (p. 115)). Gallagher and Bom- 
bieri ([3]) have observed that Bellman's inequality d0.59b contains both 
the large sieve and the idea of Halasz. 

All the results of this chapter are due to Gallagher Q). Clearly, es- 
timates corresponding to the other results of Chapter |3] can be derived 
in the same way. For general results conforming to the theme of this 
chapter and also for more sophisticated forms of the hybrid sieve, see 
Montgomery |5j. Huxley [7], Gallagher Forti and Viola Q (cf. 
Bombieri and (g) (§5)), Huxley 0, Ol, Jutila d and Huxley fTO. 

(15.4b : This condition can be relaxed to 8T < 1 - e for any e > and 
then the constant cq of d5.5b depends, as can be seen from the proof (cf. 
d5.13b ). on e. 

(15.5b : Since there is not need in applications we do not aim at ob- 
taining the best possible values (for instance, by a different choice of 5) 
67 for the constants Co and c\ in this chapter. However, just for a complete 
form of the proof we obtain some permissible values for these constants. 
From d5~T3l and {53) we see that one can take 

c = (2 sin j)~ 2 < 1.1. (5.30) 

Lemma 5.3. As Gallagher ^/ has stated, if the a n 's are irregular, 
Lemma |5~?1 is more precise than Theorem I5.il since in Lemma l5~2l the 
coefficients are first smoothed and then squared. 
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1 15.141 ): The condition (occurring in Theorems 15.11 and 15.21 ) 

oo 

Y J M<oo (5.31) 



n=\ 



of this chapter stems from Lemma |5~T1 Further, our condition of Theo- 
rem |OJ namely 



^n\a n \ 



(5.32) 



need also be satisfied; for, otherwise, the theorems of this chapter hold 
trivially true. Also, it may be noted from the examples 



1 



and a n 



A if = 2 m ,m e 

m- 

otherwise, 



nlog(n + 1) 

that the conditions d5.31t and ( 15.321 are independent. 
(15.15b : From ( I5.19t and (I5.30t we see that 

c\ = 2nco < 7. 



(5.33) 



(5.34) 



(I5.25t : For a proof of (I5.25t if suffices to verify, by differentiation 

and the use of c l/x > 1 H — (x > 0), that the function ^(1 - e~ l ^ x ) is 

x 

decreasing in x > so that we have 



x 2 (e 1/x - 1)(1 - e~ 1/x ) = (x(l - e~ 1/x )) 2 e 1/x <(e- 1)(1 - e' 1 ) for x > 1, 

(5.35) 

which gives the upper bound in (I5.25t because 



(e - \) 2 < 2e. 



(5.36) 



Chapter 6 



Applications of the Large 
Sieve 



THE SIGNIFICANCE of the large sieve is due to its usefulness towards 68 
the solution of important problems of number theory. For this purpose 
the large sieve is employed in two ways; namely, in proving results 
which have number-theoretic consequences of depth and on the other 
side, for direct applications to number theory. We shall defer the dis- 
cussion of the (latter) arithmetical version of the large sieve to the next 
chapter and confine ourselves here to a brief survey of the (aforemen- 
tioned) indirect applications. 

1. Moments of the Dirichlet's L-series 

In this section we mention the applications to the moments of the 
Dirichlet's L-series. For the historical introduction to this topic we refer 
to Montgomery |5) (Chapter ITOb- 

Gallagher [ 1 1 has shown that the large sieve (in its version of Chapter 
13) can be used to prove 



q<Q x mod 1 

Montgomery ([5] (Lemma 10.5)) reduced the problem of estimating 
the mean fourth power of L{s,x*), using the work of Lavrik on approx- 
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imate functional equations for the Dirichlet's L-series, to an application 
of the result (13.361) and (13.371 1. and obtained ([5] (Theorem 10.1)) 

T 

J] f \L(<r + it, X \ A dt « <p(q)Tlog\qT) for |cr- I| « log' 1 (qT), T>2, 

(6.2) 

which may be considered as the average-version of the generalized Lin- 
delof hypothesis: 

L(i + it, X ) « (q\t\f, \t\>l. (6.3) 

2 e 

Then he proceeds to derive easily ([5] (Corollary 10.2)) 

* T 

£ f + it,*)*.'(i + ^^)| 2 ^ « <p(q)T log 6 (qT), T>2. (6.4) 

^-modij_2. 

2. Density theorems. 

The next important applications of the large sieve (employed in its 
hybrid version of Chapter[5]along with many other ingenious ideas) con- 
cern to the 'statistical density theorems' for the zeros of the Dirichlet's 
L-series and, in particular, of the Riemann zeta function. For the history 
of this subject, see Montgomery [5] (Chapter [T2"li. 

We recall the following standard notation required for the descrip- 
tion of the results of this section. As usual, 

N(o-,T, X ) (<r<l) (6.5) 

denotes the number of zeros 

p= P + iy (6.6) 

of the function 

L(S,x) (6.7) 

in the rectangle 

o-</3< l,|y| <T. (6.8) 
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Particularly for the Riemann zeta function, i.e., q = 1, we use 

N(cr, T) (6.9) 



instead of d67 

Regarding the average over^ of (I6.5t . for a fixed q, the best known 
results at present are the following. We have 



2 N(o;T,x)^ 



X mod ^ 



\qT)^^log 9 (qT),\ <cr< f 

( Montgomery [5]( Theorem 12.1)), 
' (qT)^ l -< T)+E , I < a- < |( Huxley [12]), (e > 0). 

( ? r)(2+s)(l-<r) ) 4 < o- < 1( Jut j la [ 12 ]) 

(6.10) 

(Here and in what follows, the «; - constant is understood to depend on 70 
e whenever the bound contains e.) It is easily deduced from (I6.10t that 

£ N(o~ , r,*) « (qT)^^, X - < a < 1, ( £ > 0), (6.11) 

^ mod 

holds uniformly in o~, g > 1 and T > 2. And, for the average over q we 
have 



^ £ iV((7-,r,^)« 



g< ^ mod y 



(e 3 r)^<'-»iog 9 ( g r),i<^< | 



( Montgomery [5](Theorem 12.2)) 
(Q?T)*>*=i < - 1 -< r)+£ , | < o- < f ( Huxley [12]), (e > 0). 
(g^) 2 ^ 1 ^, | < o- < 1( Jutila [12]), 

from which one gets the estimate analogous to ( 16.1 11 1 



(6.12) 



* 1 
^ 2 N(cr, T, X ) « (^D^+^X 1 --), - < cr < 1, ( £ > 0) (6. 13) 

valid uniformly in cr, <2 > 1 and T > 2. 

The function Af(cr, T) has been investigated more extensively. Esti- 
mates of the form 



N(cr, T) « T 2 ^-^ (e > 0). 



(6.14) 
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valid in cr > a (for some a) uniformly, are called 'density hypothesis'. 
It is known that the Lindelof hypothesis: 



f(I + it) « (1 + \ t \f (g > 0) 



(6.15) 



1 



implies (I6.14t . with a = —. For some applications proved under the 



71 assumption of ( 16.151 ) it suffices instead to have a result of the type d6.14t 
Now the density hypothesis is known for 



cr> — = 0.78571 
14 



(6.16) 



and this result is due to Jutila 0, 021 (cf. E3). In cr > f| still better 
estimates are available; namely, 



N(cr, T)<T 



A(a-)(l-a-)+e 



(£>0) 



(6.17) 



with 



A(cr) = 



?f§F=T,%<<r<%( Huxley [11]), 
^,§<cr<^l( Huxley [11]), 



37, 



4(3<x-2) 37+ V73 



< cr < 1 



3(4cr-3)(2o--l)' 48 

(Montgomery [5]( Corollary 12.4)). 



(6.18) 



Close to the line cr - 1, we have (Halasz and Turan [ 1 ]) even 



N(cr, T) « T 

6 



log 3 (- 



1 



1 -cr 

and (Montgomery O (Corollary 12.5)) 

,167(l-o-) 3 / 2 ,17 



), cr > 1 - 5 (6 > 0) 



7V(cr,D « r 1 



log'T,^^ -,T>2. 
2 



(6.19) 



(6.20) 



On the other side, in the vicinity of cr = ^ and for (^ <)cr < | still 
the best known density estimates are due to A. Selberg and Ingham (cf. 
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_ _ 3 

Montgomery ]5| (Chapter Hzt ). For o~ between - and Jutila's bound 

<I6- 1 6I >. the best known estimate (16.171) with 

/l(cr) = 3(minmax , ), — < a < — 

(3o- _ i) + |(i _ o-) 2fc(4o--3) + 3(l-o-) 4 14 

(6.21) 

is also due to Jutila [10]. 

The connection between the order of £(s) and the density estimates 
has already been indicated (cf. d6.14t and (I6.15I) ). Indeed, some of the 
aforementioned bounds for N(cr, T) can be improved slightly by using 72 
better estimates for £(s). For general results in this context we refer to 
Bombieri [3] (and for L(s,x) to Forti and Viola (Q). 

3. Mean-value Theorems of the Bombieri type. 

In this section and the next we mention the applications, involv- 
ing number- theoretic functions, which have important consequences in 
(proper) number theory. As regards the notation almost all are standard 
and so we repeat only one of these, namely, the Hurwitz's zeta func- 
tion,defined through 

oo 

£(s, w) = Yjfl + W Y S > s = cr + it (cr > 1), <w < 1, (6.22) 

n=0 

and analytic continuation. 

Now, one of the most important applications of the large sieve has 
been to what we shall call as Bombieri's prime number theorem 
(Bombieri Q): For any given number U(> 0) there exists a value 
C = C{U) such that 



Zmax max y(y:q, €) 
2<y<x {es=i" 

or equivalently 



li y 



«J/ 7, ^T77 > (6-23) 



<p(q) (logx) L 



V max max\ifr(y;qj) - -\<^u — . (6.24) 

f-^ 2<y<x (e,q)=\ if (log x) u 

q<x2 log _Cr 
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A result of this kind can be derived, either via estimates of the type 



(16. 1 2I > or directly, from the large sieve. Bombieri proved (16.241) (via 
d6.12t -tvpe result) with the value C = 3U + 23, and the best known 
result now is with 

C = U + -, (6.25) 

due to Vaughan [6 ], who obtained this by a refinement of Gallagher's 
method (-a direct application of the large sieve-) for a proof of d6.24t . 
73 Jutila 1 1 ] has proved a corresponding result for short intervals which 
states that 

ZZ x e 
max max \\l/{x+z\q,€)-Mx;q, €) 1 <c :t,O<0<1. 

q<yr 

(6.26) 

where 

Ace + 29-1- Ac 

fi = m4 = 7— A e (6-27) 

6 + Ac 

in which c denotes a constant satisfying, for the functionl in ( I6.22I ). 

Z(- + it, w) <£ S (1 + \A) C+6 for every 5 > 0. (6.28) 



For the various (more or less equivalent) forms of (I6.23t and (I6.24t 
we refer the reader to Elliott and Halberstam [ 2 ] and Montgomery 
(Chapter 15). As an example, we mention an interesting remark of 
Montgomery and Vaughan in connection with the following result 
derived from (13 .4t 

A7' i * 

2 log(— ) Yj l<AW*)l 2 <^V 2 log^V, N>N Q , (6.29) 



q 

i x m °d i 



where (cf. (I1.75t ) 



HN,X) ■= ^ A (n)x(n). (6.30) 



Now the term corresponding to q = 1 in ( 16.291 contributes already 
-N 2 logN + o(N 2 ) to the sum. Further, if L(s,x\) has a Siegel - zero, 
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q\ = N 6 , then \tp{N,x\)\ > (1 - S)N and consequently the contribution 

<\ 

■ 2 



from the term for x\ m ( 16.291 ) is atleast (A - 26)N 2 log Af so that 



2 2 l<A(A^)| 2 «<W 2 logjV, (6.31) 

which seems rather unlikely to be true for sufficiently small 8. 

There are results, analogous to J6.23t . concerning the average order 
of remainder terms with respect to other number- theoretic functions. 
From these results also one has been able to obtain results which could 74 
only be proved earlier either by the use of the complicated Linnik's dis- 
persion method or only under the assumption of the generalized Rie- 
mann hypothesis. 

Such analogues are now available for the functions dk(ri) (A.I. Vino- 
gradov [1 1, Motohashi O, Q), tin) (Motohashi [2], Siebert and Wolke 
fT10 and for certain powers of these functions. Certain other s pecial 
functions have also been investigated (Siebert and Wolke [ 1 ], see (I6.33t 
below, and Wolke [7]) and interestingly we have now general results 
of the type (I6.23t . based on the large sieve, due to Wolke (cf. Wolke 
l5l . 151 . Siebert and Wolke [1]): Under certain conditions (stemming 
from the work of Wirsing) for a multiplicative number-theoretic func- 
tion f(n), one has 

V maxmaxl Y f(n) - —— Y f(n)\ « - - w , , U > 0,C = C(U). 

q<x~l- log c * rmtmoiq (n,q)=l 

(6.32) 

As an example, we mention the following consequence of ( I6.32t 
(Siebert and Wolke Q): 



(logx) 



, U > 0, C = C(U). 



l , _ c n<y 
^• t21 °g x n={modq 

(6.33) 

Orr Q (cf. El) has derived such a result for the number of square- 
free integers (i.e., /J. 2 (n)) in an arithmetic progression in an elementary 
way. For another elementary derivation of such results from a different 
type of mean- value theorems we refer to the next section. 
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4. Mean-value Theorems of the Barban-Danport-Halberstam type. 



There is another type of mean- value theorem corresponding to ( 16.241 ) 
which deals with the mean-square instead and has a considerably much 
wider range of validity (for q): 



2 -(U+UxJ^'Mvq,*)- 

q<x log 



■f « 



<p(q) u (logxY 



for U>0. (6.34) 



Such a result was found (in a slightly weaker form) first by Barban 
([6] (Theorem 1), cf. [ 10 1 (Theorem 3.2)) and was later rediscovered 
by Davenport and Halberstam 0. The improved form (16.341) is due to 
Gallagher [ 1 1. The proof is based on the Siegel-Walfisz theorem and the 
identity (Davenport and Halberstam [2]) (I1.76I) . 

q x 1 

£ '0A(x; q, €) - -^-) 2 = — J] \i/s{x, X )\ 2 , (6.35) 

f=i x*xo 

and an application of the large sieve in its form of Chapter 

Montgomery ([4], cf. [5] (Chapter 17)) discovered aproof of (16.341) 
independent of the large sieve and also succeeded in obtaining the 
asymptotic formulae: 



q<Q t=\ YyHJ 



Qx log Q + o(Qx + x 2 log x), 

for Q < x, ( for any fixed U < 0) 



Qxlogx- 



f(6) ' 



: log 7- 



Qx + Ax 1 + o{Qx\og x), 
for Q > x. 



(6.36) 



The part Q < x of ( I6.36t is an improved version of Montgomery's 
first result (Croft □). 

Montgomery's method of proof of ( 16.361 ) is based on a deep theo- 
rem of Lavrik [ 1 ], [2], about the average order of the error-term in the 
generalized twin-prime problem, which may be stated as 



Yj Yj Mn)A(n-2q)-2(x-2q)Y\(l- 



1 



q<j 2q<n<x 



p>2 



(p-i) 



»>n£5>** - 



2<p\q ■ 



log, 
(6.37) 
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and which in turn depends on the method of I.M. Vinogradov for the 
estimation of exponential sums. 

Regarding the first part (Q < x) of (I6.36t . Hooley [5 ] has shown that 
one can replace the error term by 

AQx + 0(Q 5/4 x 3/4 + x 2 log' 11 x) (6.38) 

as an application of the (simpler) large sieve method only. 

Hooley [ 8 1 has also proved, on the basis of the large sieve method 
(of Chapter |3, the general result of the Barban-Davenport-Halberstam 
type: Let y be a set of positive integers and suppose that there holds, for 
all U > and all integers q, i, (for some function g(q, bj) 

S{x;q,l):= I = g(q,(l,q))x + Ou(xlog~ u x) as x — > oo. 

ney 
n<x 
n=t mod q 

(6.39) 

Then 

2 2 (5 (x; q ' l)) - g{q > (/ ' q)x)2 = 0{Qx) + 0{x2 log ~ u x) 

q<Q 1=1 

for 1 < Q < x (for any fixed U). (6.40) 
Combining this with the method of his paper [ 5 ] he derived that 

YjYj( M")) 2 = —Q x + °( x2 l °g~ u x ) for 1 ^ x ' ( for an y fixed f 7 )- 

9<G '=1 

n=/ mod t/ 

(6.41) 

An interesting connection between the mean-value theorems of the 
squared expression and those of the preceding section was noted by Bar- 
ban [6 1. He observed that from a mean- value theorem of the type d6.40t 
it is possible to derive in an elementary manner a Bombieri-type result 
for a related function (depending on the parameter x). Here we shall 
present a proof in the simplest case: Let us put 

ifr 2 {x;q,l):= J^Yu A (m)A(n 2 ), (6.42) 

n\<x ni<x 
nini=(modq 
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77 and assume that 



E 2 (x;q,l) := i// 2 (x;q,l) 
E\(x; q,l) := (p(x; q, I) - 



<P(qY 
x 



<KiY 



(6.43) 
(6.44) 

(6.45) 



We have 



n\=hr l lmoAq 

= ' \p{ y[x; q, h)t//( yfx; q, h~ l l) 

h=i 



ni=hmoiq 



(6.46) 



(here h 1 represent the residue class mod q for which hh 1 = 1 mod q) 
and 



2 'GK ^^-^r\ ^ V^) - V^l + log (6.47) 



0(9) 



Subtracting from both the the sides of (I6.46t and using (16.471 ) we 

* <m — — 

obtain, by Cauchy's inequality, 



E 2 (x;q,l) < Ya ' Ei{yJx,q,h)E\{^x,q,h ~ I) 



h=\ 



+2^g{|^(Vi)- V3 + log?}< 
£'El(^;q,b) + 2j£ {|<K V?) - \£| + log q] 



(6.48) 

uniformly in i subject to d6.45t . Now summation over q and the use 
l an application of t 

GKV*)- V*) « 



of (I6.34t along with an application of the prime number theorem in the 
form 



(log x) 



U+l ' 



(6.49) 
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give 



V max V V A(m)A(n 2 )- 



' i) (/ ' ?)=1 <r <r ^ ( lo § x ) 

g<V^log l '-v ni<-\jxn 2 <\x 

(6.50) 

Thus we have shown that this Bombieri-type result is an elementary 
consequences of (I6.34I) . 

5. Some number-theoretic applications. 

In this last section we merely record a few applications of the pre- 
ceding result to pure number theory 

First of all we have the important consequences about the difference 78 
between consecutive primes that 

Pn+i ~ Pn < Pn for n > n (e), e > 0, (6.51) 

with 

5 = 1 - 1 (6.52) 
A 

whenever there is a result ( 16.171 ) with uniform A valid in - < cr < 

2 

1 (cf., for the history of this question, Montgomery [5| (Chapter 14)). 

1 



Therefore the density hypothesis (I6.14t with a = — would give (I6.51t 

with 6 = —. Montgomery [3] proved ( 16.5 It with 5 = 3/5 and Huxley 
151 (cf. Q (P- 1 19)) succeeded in getting the best known value 

5=^. (6.53) 



(Now, in view of our initial statement, we can get (I6.53t from either of 
d6.11t or ( 16.131 ) also.) 

An analogous application of ( 16. 10l > is known with respect to the least 
prime p\(q, i) in the arithmetic progression {{, I + q, I + 2q, . . .}, {€, q) = 
l,0<£ <q. This stated (Iwaniec El) 

Pl(qJ)« £ , lj{q) q A+£ ,e>0, (6.54) 
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so that one has from Jutila's result ( 16.111 ) that 

Pl(q,l)« e , q(q) q^ + ",€>0. (6.55) 

(However, observe that the «:-constant depend on the kernel of q (cf. 
dl.l9t ). Here, with respect to Linnik's famous theorem, the best known 
exponent in ( 16.541 ) with <sc-constant independent of q(q) is 550 proved 
unconditionally by Jutila.) 
79 Regarding the analogues, for other arithmetical functions, of the 

Bombieri theorem as well as of Barban-Davenport-Halberstam theorem 
mentioned earlier, we have their applications in the proof of various del- 
icate asymptotic formulae, involving such function as the divisor func- 
tion dk(n), r(n), /u(n) etc., as also general function fulfilling certain con- 
ditions, their powers and some mixed forms of these functions, and fur- 
ther, with the argument running through certain polynomial sequences. 
For these problems we refer to Barban [6], [10], Elliot and Halbertam 
fTl . Hooley (1 ], [6|, Huxley and Iwaniec [ 1 ]. Indlekofer [ 1 ], [2], Iwaniec 

0, 0, Katai Linnik H, Motohashi O, 0, H, 0, Orr 0, 0, 
Proter 0. Q, Rodriquez [ 1 ], Siebert and Wolke Q, Vaughan H, 0, 
A.I. Vinograndov 0, and Wolke 0, 0, 0. 

NOTES 

1. : Montgomery and Vaugham [ 1 ] have shown that the proof of the clas- 

sical formula 

T 

f |<T(- + itfdt = T log T + 0(T) (6.56) 
o 

can be greatly simplified by an application of Theorem 14.31 Ra- 
machandra [6|, [7] has extended this to various other moments with 
reports to £(s) and L-functions. Such results are important for the re- 
sults mentioned in the second section of this chapter (as has already 
been indicated in the context of (I6.15l i). 

(I6.lt . (I6-2I >: For similar results involving \L(- + it,x)\ 2 see Gallagher 
More general forms, with an averaging over certain well-spaced 
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?-sets of points, can be obtained by an additional use of d6.2t (see 
Montgomery [5|) (Theorem 10.3 and Corollary 10.4), and Huxley 80 
171 (pp. 97,108)). Such results can be found in Ramachandra [3] for 

in - < cr < 1, for L(s,x) in Ramachandra and Jutila fTOl. 
and with^mod p for L{s,x) in Elliott [9]. 

( 16.21) : A simple proof of (16.21) has been given by Ramachandra [7 ]. 
For a result of this type with real characters only see Jutila [7 ]. 

(16.41) : For a mean-value theorem for \L'(- + it,x)\ 2 which can be 
derived in the same way, see Vaughan [6]. 

2.: (16.51) : Gallagher [ 8 ] has proved that one has 

N(cr, T,x) « T 3 ^^ log c T, for q < T (6.57) 

and 

2 (N(cr, T + \,x)- N(cr, T, X )) « q 3 ^ log c T, for T < q 

Xmodq 

(6.58) 

with some constant C. 

For results about the size of L(l,^-),^mod p, we refer to the papers 
of Bareman, Chowla and Erdos O, Barban (S|, fM- Elliott O. 
Joshi [ 1 ] and for any L(s,x) to Elliott [ 8 ]. 

(16.101 . ( 16.121 ): For estimates of these averages, under certain re- 
strictions on q and Q, respectively, see Ramachandra [4]. For earlier 
results, in particular, regarding the 'generalized density hypothesis' 
(namely, the estimates H6.10II . ( I6.12t with the exponent 2(1 - cr) + e 
instead), see Balasubramanian and Ramachandra [1], Huxley [9], 
Huxley and Jutila [ 1 ], and Jutila [5], [9], [10], [ 11 ]. 

( 16.121 : For a result, with the summation restriction to real charac- 
ters only, which has better estimates in some cases, see Jutila [4], 
jSJ. A. Selberg [6], cf. Montgomery [8]) had proved (using his 81 
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results d3~T4l and CT ) that 

^ 2 AT(cr, T,X) « e (QfT 3 )^ 1 -^, X - < <r < 1(6 > 0). 

mod 4 

(6.59) 

In this connection we have also the following deep result due to 
Bombieri ([6] (Theorem 14)): If there is a 'Siegel-zero' j3\ (relative 
to T > 2 and a certain constant cq > 0) then holds 

2 2 ^^^((l-^Ologr)^ 1 ^ (6.60) 

g<r^mod</ 

with some absolute and effective c and <sc-constants, where on the 
left4iand side the exceptional zeros are not included. As an applica- 
tion of this Bombieri [6] derives the well4mown theorem of Siegel: 
One has 

1 - 0i > c{e)T- £ , T>2(e> o), (6.61) 

with (as in all other known proofs) an ineffective c(e) > o. We 
briefly sketch this deduction. Introducing 8 as the supremum of the 
real parts of the zeros of all the Dirichlet's L-functions, we see easily 
that one can suppose (for the purpose of ( 16.611 )) 9-1. Now, taking 
T to satisfy 

Co 

T > max(<7o, lyol, exp(- — — )),/? > 1 - e. (6.62) 
1 - po 

where qo is the (least) modules of the L-function which has a zero 
p = yS + ijo (with /Jo > 1 - e), we see that p is not an exceptional 
zero (relative to T and cq) and also that the left-left4iand side (16.601) 
with cr = 1 — e is > 1 . Hence we have 

(i-A)»((iogr)rT 1 (6.63) 

which gives (I6.61t for T subject to (16.621) and so for all T > 2. 
(16.64b : Halasz and Turan [ 1 1 have shown that (I6.15t gives (even) 

3 

N(o-, T) « J e , for cr > - + 6, ( with e > o). (6.64) 
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Also they have proved in that the generalized Lindel of hypoth- 
esis, 

L(s,x) «: uniformly for cr > \, \t\ < T, (6.65) 
€ t 2 

implies the uniformly estimates, for A6.12I ). 

1 3 
<K g 4 < 1 - £r > +e for - < cr < 1 and <s; g e for cr > - + 5(e >o,5> o). 

£ T 2 c t,6 4 

(6.66) 

J6.16t : In this connection, for earlier results, see Montgomery [3], 
Huxley [6], Ramachandra [8], Forti and Viola [2], Huxley [9], [11], 
d, and Jutila H. 

d6.17t : Ramachandra O has proved for cr > — that 

Af(cr, T + T 5/12 ) - N(cr, T) « T^ 1 -^, (e > 0) (6.67) 

6 

and more generally, Balasubramanian [ 1 ] has shown that, uniformly 
1 

in cr > - 
2 

N(cr, T + H) - N(cr, T) « HttQ-*) log 100 H for >H> T. 

(6.68) 

( I6.2U : This type of exponents in ( 16. 101 ) and ( 16.121 ) occurs also in 
Ramachandra [8], and Balasubramanian and Ramachandran fill . 

d6.23b . d6.24b : The first result of this type is due to Renyi [2]. Re- 
placing the sum on the left by 

2 - (6.69) 

q<x" 

Barban [2] improved Renyi's result (i,e., with a certain small a > 0) 

to a = - - e (for every e > 0) and succeeded then ([4], [5|, [6]) in 
6 

extending it to all a < 3/8. Since the value of the result increases 
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considerably with a, Bombieri's theorem has led to most important 
applications. A.I. Vinogradov [ 1 1 proved the same theorem (through 

in a slightly weaker form, namely, with a - — — e for every e > 

0) simultaneously. A result of this type with a > — would have 

important consequences (cf. Buchstad [2|, Helberstam, Jurkat and 
Richert Q) and it has been conjectured by Elliott and Halberstam 
[2] (cf. Elliott [6 1) that every value of a < 1 is admissible. 

(I6.25t : Gallagher [2] uses in his proof of (I6.24t the decomposition 
V V 

83 — = (1 - LG) 2 — + 2L'G - LL'G 2 where G is a partial sum of 

1 

the Dirichlet's series for — , instead of which Vaughan |6], to obtain 

( 16.251 . uses the more efficient splitting — = (— + F)(l - LG) + 

U 

(L' + LF)G - F with F being the partial sum corresponding to — . 

L 

( 16.261 : For a different proof of (a slightly weaker form of) (16.261 ) 
see Motohashi [ 5 ] ; more recent results which in certain ranges (for 
9) improves d6.26t are due to Huxley and Iwaniec [1] (according 
to Motgomery (oral communication), S. Ricci, a student of his, has 
obtained a similar result). 

d6.28t : The best exponents known here, 

173 1 

c < < -, (6.70) 

_ 1067 6 V ' 

is due to Kolesnik [ 1 1. 

< !6.29b : For similar results, see Bombieri and Davenport [2|, Mont- 
gomery [5 1 (Chapter 15), and Vaughan [6]. 

d6.32t : In this context, we mentioned that the function r(ri) which 
appears in this and the last section of this Chapter and for which also 
( 16.321 holds, denotes the general 

tin) := Y,Xid) ,XTaodq,n e N : (6.71) 

d\n 
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in particular, when x is the non-principle character to the modulus 
4, r{n) denotes the number of representation of n as a sum of two 
squares (apart from a factor of 4). 

4.: d6.36t : For stronger and more general results, and also for those 
which can be obtained on the assumption of the generalized Rie- 
mann hypothesis, see Hooley 0, Q, 0. The corresponding prob- 
lem for 

2 fin) ( 6 - 72 ) 

n>x 
n=l mod q 

has been investigated extensively, both with the help of the large 84 
sieve as well as by elementary methods only (Orr [ 1 ] (cf. Orr [2]). 
Warlimont IB, 0, and Croft □]). Motohashi [6] has used the idea 
of Montgomery's proof, namely without using the large sieve, to 
obtain an asymptotic formula with respect to 

2 d{n) (6.73) 

n<x 
n=l mod q 

d6.50t : Barban (El (Theorem 3.3)) has extended his result also to 
the more general sums 

2 (6 " 74) 

Pi<x 
i=l,...,k 
pi...pic=l mod q 

where 

< a\ < . . . < a k , ai + ■ ■ ■ + a k = 1, k > 2, (6.75) 
of which ( 16.501 ) corresponds to the case 

k = 2; a\ = - —. (6.76) 

Orr [ 1 ], [ 2 ] has obtained results, with sums of the type (I6.74I) . where 
primes are replaced by squarefree numbers, and also with mixed 
products instead. 

For results of others types concerning short interval we merely men- 
tion two recent papers: Gallagher (5), and Ramachandra [10]. 
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5.: d6.51t : The history of the more general question of the number of 

85 primes in short interval 

[x, x + x s ] 

starts with Hohesisel, who was the first to prove a result with a 5 < 1 , 
actually with any 8 > 1 - (33000)" . Another famous question, 
concerning small differences of primes, is to find an estimate of the 
form 

lim Pn+l ~ Pn < 9, (p„ : n th prime). (6.77) 

n^oo log p„ 

It is conjectured that this lim is zero, an obvious consequences of the 
twin-prime conjecture and it has been known that, under the assump- 
tion of the generalized Riemann hypothesis, one would have (16.771 ) with 

9 = — . However, Bombieri and Debenport obtained (16.771) uncondi- 
tionally with a 9 < ^, in fact, with 9 = -(2 + V3) = 0.46650 . . ., where 

8 

they used among other ideas the large sieve. Now we have d6.77t . due 
to Huxley EH, with 

9 = — ^ - 0.44634 . . . (6.78) 
16 

For earlier results generalizations and allied questions see, for in- 
stance, Huxley [2], Uchiyama [ 1 ], Moreno [ 1 1, and Wolke fTTI. 

(I6.54t : Forg - //,see Gallagher [5]: for previous result and further 
references see Jutila |5|. 

(16.55b : The corresponding problem for general sequences, in par- 
ticular for squarefree numbers, was dealt with by Wolke ifTl . 121 as an 
application of (2.103) (see also Warlimont [2]). 

We have more application for the result of Section 2., as also of those 
in the notes to that section. First of all there is the question of the least 
character non-residues for which we refer to Montgomery [5] (Chapter 
n"3l. and Elliott 0. For estimations of character sums, see Montgomery 

86 (Chapter [TJ), Gallagher H |D, and Jutila (as also Jutila 0). 
Regarding primitive roots and Artin's conjecture we mention the papers 
of Gallagher Q. Burgess and Elliott 0, Goldfeld 0, Elliott 0, 0, 
Vaughan [1], and for some related questions, Gallagher 0, 0, and 
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Goldfeld [2]. Finally, for the applications to the problem of the largest 
prime factor of numbers we refer to the papers of Goldfeld [2], Hooley 
0, El, Jutila 0, Motohashi 0, (TT), and Ramachandra 0,0. 



Chapter 7 



The 'Sieve Form' of the 
Large Sieve 



AS HAS been mentioned in the beginning of the preceding chapter, we 87 
consider now the (direct) general application of the large sieve in its 
arithmetical form, in the sense of B. of Chapter |U| Let us recall the 
situation described there. Consider the set of integers in an interval 
(M, M + N]. For each prime p (be-longing to a certain set p) we drop 
from our set all such numbers as which fall in any one of certain a>(p) 
of the residue classes mod p, and denote the set of remaining integers 
by y. Our object is to obtain an upper bound for S := \y\. 

If io{p) is (absolutely) bounded the sieve methods of Brun and Sel- 
berg yield satisfactory result. On the other hand, these method fail if, 
for instance, co(p) is an increasing function of p. This was the reason, 
as has already been started in our remark preceding dO.261) . that Linnik 
called his method the large sieve. In this context one might ask whether 
it is not possible to have a version of the large sieve which shall include 
the Selberg sieve, say when co(p) is bounded. 

This problem has not yet been solved. However as we shall see, it 
is possible to adapt the large sieve for this purpose provided we confine 
ourselves to the 'linear sieve' and aim merely for the simplest Selberg 
upper bound. 

In connection with his problem, we recall that in the original form 
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(of Linnik's method) there was the defect, of having the summation re- 
stricted to primes only (cf. d0.54t ). which existed upto the first paper 
of Gallagher [1]. Bombieri was the first to notice (cf, Bombieri and 
Davenport [2]) that one can solve this problem if the dimension k (cf. 
(I11.3l l) of the sieve problem in question equals one (i.e., u>(p) = 1 on 
88 the average). In the general case. Montgomery [ 1 ] obtained this result 
first, by discovering the identity (in our notation of Chapter 



and thereby extending (10.531) to composite numbers. 

However, there is a simpler way of dealing with this problem which 
does not make use of (17.11) but starts instead with the well-known for- 
mula dl.34t . This was first found for k - 1 only (cf. Richert [2|). 
Later Huxley [5] (see also Montgomery [5] (Chapter [3Jl, and Huxley [7] 
(Chapter 01 succeeded in extending this method to the general case (cf. 
( 17.241 ). In both the cases the question is reduced to an application of the 
large sieve in its version of Chapter |2 Therefore, the best known solu- 
tion so far, for our sieve problem, is due to Montgomery and Vaughan 
Ell (regarding ( I7.6t . see also Gallagher (6j), who derived this from their 
strong result given in Chapter 

Before stating the main result of this chapter we sketch now the 
aforementioned simpler approach in the case k - 1 : Let £2 denote the 
set of all natural numbers composed of only primes p in our set p and let 
us drop (as in Selberg sieve) from the set of integers in (M, M + N] those 
divisible by some p e p to obtain our y(to(p) = 1). (All other notation 
are as usual.) By (11.34b . we have (since (n, q) - 1) 



e=i "* 

Summing this over all n e y and interchanging summation we obtain 
(on squaring both sides) 




(7.1) 




(7.2) 




(7.3) 
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which gives, by Cauchy's inequality and summing (after a division bu 
<p(q)) overall q e £>, q < Q, 89 

tsa ^ #q*i q 

Now an application of ( 12.901 ) gives an upper bound of the desired 
from for 5 : 

N+Q 2 



S < . (7.5) 

<p(q) 



q&S. 

Now we come to the main result of this chapter. We state it as 

Theorem 7.1. Let y c (M, M+N] be a set ofS integers. For each prime 
p let us denote by a>(p) the number of residue classes which do not have 
any number from y. Then, for any z > 0, we have 

S <- (7.6) 



where 



and also 



^— 1 11/7- 0)(p) 



q<Z p\q 



S < (7-8) 



with 



L*{zY 

L\z) = J](N + lqzrV( q )U^7-Y ™ 
~^ 2 i , i p - aj(p) 

q<Z p\q 1 yt 

Remarks. Observe here that the inequalities d7.6l l. (17 - 8 1 > . and also those 
of Theorem 17.21 do not deteriorate if z is replaced by its integral part 
(with an obvious interpretation if < z < 1- Therefore, we set Q=\z\ 
and it suffices now to prove our results for Q > 1 . Further, these results 
remain true if to(p) = p for prime p (since then 5 = 0). Hence we 
assume throughout that 90 
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u>(p) < p for all p (7.10) 

holds. 

Proof. Our first objective is to prove il. 241 ) , an inequality which trivially 
if o>(p) = for some p\q or if p{q) - and also for q = 1. So we can 
impose the conditions that 

1 < Q) is a squarefree number, (7.11) 

as well as (cf. d7.10t ) 

0<oj(p)<p Vp\q. (7.12) 

Now, for each p\q we have (for certain h = h{p)) 

n £ ft,- mod/?, i - 1 0){p), Vw e y, (7.13) 

which restrictions are equivalent to, by Chinese remainder theorem and 
(T77TTI . (with certain / = f(q)) 

(n-fj,q) = l,j=l,..., oj(q), Vn e y (7.14) 

with 

o%) = f[^(p), (V.15) 

so that co(p) become a multiplicative function (£ 0, because of (17.121) ) 
on setting 

w(l) = l; (7.16) 

i.e., we also have 

ojem. (7.17) 
Next, in view of ( 17.141 . we have, by ( 11.34ft . 

i p 

p{q) = V ' e((n - /,)-) Vn 6 y and j = 1, . . . co(q). (7.18) 

Here summing over all n e y and also over all j followed by squaring 
91 both the sides results in, by Cauchy's inequality, 
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a)(q) 



S^(q)aj\q) = (Z' Z e(n±) Z e(-//p) 2 < 



^=1 ney 
9 



9 ^(?) 



(7.19) 



where we have employed as before (cf. il.3l ) our usual definition (cf. 

urn ) 

T(x) := J] e{nx). (7.20) 

Denoting the second factor in the last expression of M. 191 ) by Zo(<?) 
and taking the summation inside, we see that one has on using dl.29t 
and dl.32t 



/ a>(q) oj(q) 

Zo(9) = Z c q (f f -fj)= Z Z dfiQ) 

7,/=l 7,J'=1 d\q 

d\ff-fj 

d oj(q) 

= Z Mi) Z ( Z i) 2 - 

d\q b=l j=\ 

fj=b moid 



1\ - 



(7.21) 



Note that here the b' s for which the corresponding inner sum is not 
empty are precisely those u){d) forbidden residue classes modJ, be- 
cause of (I7.14I I. Further, for each such b the inner sum counts the same 
number of ffs, namely a>(q/d). Hence, from (I7.21I) . 



o d\q 



(7.22) 



Now taking f\(ri) - nto(n), fziri) = n(n)u) 2 (n) in dl.lBt it follows, 
because of d77T7b . (fTTTl and (fTTT2l . that 

= Y](poj(p) - co\p)) = Y\{oj(p)(p - oj(p))}. (7.23) 
Using ( 17.231 ) in ( 17. 191 ) and noting ( 17.151 we obtain 
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This is the basic inequality providing the connection between our 92 
sieve problem for y and large sieve method. In view of the remarks at 
the beginning of the proof, ( 17.24b is valid for all qeM. 

Finally, summing (17.241 1 over all q < Q it follows, by (17.71 1 and 
USB. that 



S 2 L(Q) < £ Z ' |r(-)| 2 <(iV + 6 2 )^U(iV + Q 2 )S. (7.25) 



4<e *=i 



n=ey 



This proves (I7.6t . on recalling our earlier remark preceding (17. 10b . 

Further, ( 17.81 1 is proved in the same manner by multiplying ( 17.241 1 by 

3 

(/V + -qQ)~ l before summation and then using ( I2.91I ). Thus Theorem 
17. H is completely proved. □ 

The following seemingly more general result is easily derived from 
Theorem ET] 



Theorem 7.2. Under the assumptions of Theorem \7.1l let a n be arbi- 
trary complex numbers satisfying 



a n = Vn t y. 
Then, for any z > 0, we have 



Z a " 



2 < N + z z 



M<n<M+N 



L(z) 



Z 



M<n<M+N 



and 



2 Un ~ L*(z) 2 



(7.26) 



(7.27) 



(7.28) 



M<n<M+N 



M<n<M+N 



Proof. It follows from A7.26I ). that by Cauchy's inequality the left-hand 
side of (17.27b (and so also of (17.281) ) is 



- ( Z 1)( 2 i a »i 2 )= 5 Z ki|2> 



(7.29) 



«ey M<n<M+N M<n<M+N 

from which our results are readily obtained from Theorem 17. II □ 
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93 Remark . One might like to consider Theorem 17.21 as a weighted from 
of Theorem I7.lt but the restriction d7.26t and the relation d7. 291 ) show 
that Theorem l7.1l is never weaker than Theorem 17 .21 



NOTES 

d7.ll >: Montgomery's [1| proof of (I7.lt proceeds 
way: Set 

T(q,h):= V 7\-M-/j-) 

so that, by J 1.27b . one has 

q q i 

Y,\T{qM 2 = q^'\T(-)\ 2 . (7-31) 

h=\ [=\ 1 

Now, using STM . (fT32l and dO) . 

q t 

t{q,h) = Y,Ta e{{n ~ h) ~ ) = Z Z d ^2 ] = 2 dnfys(d,h\ 

ney h=l ^ ney d\q d\q 

d\n-h 

(7.32) 

from which d7.il) is derived by means of dl.30t and d7.3U . 

For identities of this type see (II .27b . Montgomery Q, Huxley I71l 
(Chapter 18), and Sokolovskij fll . 

Theorem 17. II This result contains the (B)-version of the large sieve, 
as well as, in the cases mentioned in the introduction of this chapter, 
the 'small' sieves. On the other hand, as per an observation made by 
Kobayashi |1 1, one can also derive Theorem l7.il and consequently also 
Theorem 17.21 and Theorem 18.11 in these cases from the Selberg sieve 
with the additional tool of Theorem l2.6l (see Halberstam and Richert Q 
(pp. 125-126)). 

In the case of u>(p) being close to p (at least on the average), the 
'larger sieve' of Gallagher [3 ] is more effective. This sieve also includes 
prime-power moduli. 



in the following 
(7.30) 
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7. The 'Sieve Form ' of the Large Sieve 



d7.6t : Johnsen [ 1 1 has generalized Montgomery's [1 1 first result of 94 
this kind to include non-squarefree numbers also by reducing the ques- 
tion to an inequality of the type of (I7.24t . so that the improved version 
( 12.901 1 leads to the following: 

'For each p remove all but g{p) residue classes mod p. In each 
of the remaining classes, remove all but g(p 2 ) different residue classes 
mod p . and so on. Then the number of n < N which remain is at most 
(N + z 2 )/L(z), for every z > 0, where 



with h(p y ) = g(p)g(p 2 ) . . . g(p y ) being the number of residue classes 
mod p v remaining at the v th stage'. 

A simpler proof of this result has been given by Gallagher __ (cf. 
Gallagher §_). 

(17 .22l i : For the remark that precedes (I7.22t note that otherwise, by 
d7.15t and (I7.11t . there will be one forbidden bmodd with more than 

= co{-) of distinct f's = bmodd and hence for any «(e y) and 
u){d) d ' 

one such we would have (n - fv — ) > 1 contrary to (I7.14I) . 
_____ J d _____ 

(I7.24t : For a variant of the proof of ( 17.241 see Montgomery __ 
(Chapter _J, and Bombieri [6] (p. 21). 




,v-i 



(7.33) 



Chapter 8 



The Brun-Titchmarsh 
Theorem 



IN THIS chapter we deal with an important application, of the arithmeti- 95 
cal sieve result of the previous chapter, in prime number theory. One of 
the prominent problems of number theory is the study of distribution of 
primes in arithmetic progressions; i.e., to investigate 



in particular, to obtain estimates valid uniformly in (ranges of) k (relative 
to x). 

In this direction there is the famous Siegel-Walfisz theorem that, for 
any C > and U > 0, 

11 x 

n(x\ k, I) - — — + Ou,c(x^og~ u x) uniformly in k < log c x (8.2) 



(with an ineffective Oconstant due to the possible existence of a Siegel- 
zero), which was one of the main tools in I.M. Vinogradov's proof of 
the solubility of the famous equation 




1, 



(8.1) 



p<x 
p={ modi 



<f(k) 



IN + 1 = p\ + p2 + P3 for N > No. 



(8.3) 
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8. The Brun-Titchmarsh Theorem 



Titchmarsh used the generalized Riemann hypothesis to tackle the 
divisor problem (since then named after him) 

2^ d(p - 1) ~ cx as x — > oo (certain c > 0) (8.4) 

p<.v 

where he also employed (cf. ( 18.211 )) the estimate 

x 

7t(x;k,£) «: a — — for k < x a (fixed or, < a < 1), (8.5) 

a(&) log x 

which he obtained by Brun'sieve. 

This problem provides a good example to illustrate our earlier re- 
mark that Bombieri's prime number theorem can replace the general- 
ized Riemann hypothesis on average, since ( 16.231 ) states that is the 

<p(k) 

i 

leading term for almost all k < — — . 

log c x 

Now we are in a position to outline the unconditional proof of ( 18.41 ) 
employing (16.231) and d8.5b . To start with have 

2 d (p-i) = 2 Z 1 = Z X i=Z»ta<U) ( 8 - 6 ) 

p= 1 mod d 

(in the notation of ( 18.1ft ). Let U > denote a number to be suitably 
restricted and let C = C(U)(> 1) be a value for which ( I6.23t holds. 
Then splitting the range of d in the last sum of ( 18.61 ) into three parts 

d < x? log" c x, x? log" c x < d < x? log c x, x^ log c x < d < x (8.7) 
and denoting the corresponding partial sums there by 

E-Z-Z < 8 - 8 > 

1 2 3 

we see that, in view of d8,5t . one has 
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where " denote the restriction of d to the second range in (I8.7t . Here we 
have used the fact (cf. Estermann [ 1 ]) that 



pip - 1) 



d<y r v ' p 

Next consider (in a notation similar to the one above) 

2>X Z >=Z2> 

3 p<x p<xd\p-\ 

p= 1 mod d 



(8.11) 



Treating J)i similarly we see that (after a simple rearrangement) 
since C > 1, 

Z ^-i)^ + * log_1 * (8 - 12) 

1 3 p<x d\p-\ p<x\og- lc x 2 



from which one obtains, with the help of ( 18.91 1 and d8.6t . 

^jcloglogx^ 

1 
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p<x 



logx 



-)• 



(8.13) 



Now, by our choice of C and d6.23t . we have 

' // x ' 1 ' 1 

2 = 2 * d - ^ + " * I ^ + B * Z ^ + 0(x ^ *)■ 

(8.14) 



Using here (I8.10t again we get, in view of (18. 13b - 

Zi r x log log x 

d(p - 1) = 2 fa' x(c log(x3 log" c x)) + 0( ^ ^ 



p<x 



logx 



) (8.15) 



on taking U = 2 say. Thus, after inserting the value of c from (I8.10t . we 
have the asymptotic formula d8.4t in a more precise form: 



Y J di P -l) = x\\ { l + -^- T)+ Oi 



p<x 



x log log X 
logx 



(8.16) 
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8. The Brun-Titchmarsh Theorem 



For number-theoretic purposes the 'Brun-Titchmarsh Theorem' 
(18.51) is very valuable being the only known result valid in such a wide 
range for k. 

In the direction of (I8.5t we shall prove the following result of Mont- 
gomery and Vaughan [2] as an application of Theorem l7. Il ( (l7.8t ): 

Theorem 8.1. For any positive real numbers x and y, and for any l,k 
from N with 

(€,k) = \, (8.17) 
there is an absolute constant cq such that 

2y 

n(x + y\k,£)-n(x\kj) < - v — , (8.18) 

tf*)(log(g) + g) 

provided 

j>c . (8.19) 
k 

Remark. Under the assumptions of Theorem l8.1l we have, in particular, 

____ 13 

the estimate (18. 1 8I > without the term — . In this context, by adding some 

15 

numerical computation, Montgomery and Vaughan have also shown that 

cq can be taken equal to 1, i.e,. we have the neat result 



2y 

n(x + y;k,€)-n(x;k,€) < — -, 1 < i < k < y,(€,k) = 1, Vx > 0; 

(p(k)\og(f) 

(8.20) 



so, in particular, choosing x - (and replacing y by x) 

2y 

n{x;k.€) < — r , 1 < I < k < x,(_€,k) = 1. (8.21) 

(p(k)log(f) 

Proof. Let z > denote a number to be suitably chosen later. Consider 
the set 

oo 

y := {m : x < mk + t < x + y, ((mk + £), J~[ p) = 1}. (8.22) 

p<z 
P\k 
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In the notation of Theorem l7.1l we have 
M 



x — I 


,N = 


x + y - t 




x — l 








[ k \ 


[ k 




I k \ 



and so 



y --\<N < y - + l. 
k k 



(8.23) 
(8.24) 



Note that y contains all those prime numbers counted in n{x + y; 
k, i) - 7t(x; k, €) which are > z, and also 



u{p) > Vp<z,p\k. 



(8.25) 



Hence, by < !7.8b . 



n(x + y;k,€)-n(x;k,t)<S +z< 7 —+< 1 -^-- + z, (8.26) 



where, by ( I8.25I ). 



1<z 
(<?,*)=! 



We now choose 



99 



so that 



which makes 



z = (3AO 5 



2 z 



3N 



M k (z)= ( 1+ 9 Z_1 )' 



(?,*)=! 



<p(q) ' 



(8.28) 
(8.29) 
(8.30) 



The equality part of d3.26t and the observation that (1 + qz l ) 1 
decreases as q increases, enable us to uphold 

M,( Z )>^2(1 + , Z -V^ = ^M 1( Z). (8.31) 



104 



8. The Brun-Titchmarsh Theorem 



Now we need, instead of the estimate in ( 13.26b which yielded only 
the leading term, the more precise formula due to D.R. Ward [ 1 ], namely 
that 



Z/J. 2 (q) 
log w + c\ + o(l) as w — > co, 
win) 



<p(q) 



(8.32) 



where 



log/? 

pip - 1) 



lim(log«- V = 1.33258- •• (8.33) 

w— too i—i n 



(cf. Rosser and Schoenfeld [ 1 ]). Using (18.32ft for partial summation we 
get as N — > oo 

1 1 3 

Mj(z) - logz + ci -log2 + o(l) = -logtf + ci -log2- -log- + o(l) 

(8.34) 

because of (18.28b . and consequently 

^_!-Mi(z) - X - log(tf + 1) + c 2 + o(l) as # -» oo (8.35) 
with, in view of d8.33l >. 

13 1 13 

c 2 = ci - log2 - - log - = ci - - log 6 > — . (8.36) 



We obtain, on using ( 18.321 ). ( 18.35b and ( 18.28b . 

N - 1 w(k) i 

N 2k 

(8.37) 

Thus, on choosing a sufficiently large Co, it follows from an use of 
dOTt indOoTi 



n(x + y; k, I) - n(x; k, I) < 



2k(N - 1) 



<p(k)(\og(N + 1) + 2c 2 + o(l)) 
2y 



<p(Wog(l) + jf ) 



+ (N- 1)-- 
(8.38) 



because of ( 18.24b and ( 18.19b . This completes the proof of the theorem. 
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NOTES 

For the history of Brun-Titchmarsh theorem we refer to Halberstam 
and Richert Q (Chapter©. 

(18- 12l>: The estimations of (18. 12l> are obtained as follows. We have 



z-z=z Z'-Z 



Vd\p-1 



d\p-\ 



(8.39) 



For any fixed p < x, the difference of the sums inside is precisely the 



number of d's(< xi log c x) dividing p-1 and with 



p-1 



< X2 log c X. 



Now splitting such d's into parts according as 



1 



< log c x or not 



and noting further further that the second part is empty if p > log" x 
we obtain the first majorization. The remaining part employs only the 
simple X d(m) = Oiylogy). 



This unconditional result is due to Rodriquez and Hal- 
berstam [ 1 ] (cf. Halberstam and Richert [ 1 1 (Theorem 3.9). More gen- 
eral results of this type have already been mentioned in the notes of 
Chapter |6] 

Theorem 18.11 The estimate (I8.20t demonstrates the power of the 
weighted sieve of Montgomery and Vaughan (cf . under Theorem 17.11 
in the notes for Chapter Q However, an estimate of the type (18.18b can 
also be derived without the use of obayashi's results: cf. Halberstam 
and Richert [ 1 ] (pp. 124-125)). On the other hand, it is easily checked 
that if we use, at the beginning of the proof, the estimate MM instead 
of (T7.8b we cannot obtain (18.20b . even subject to the condition ( 18. 19b . 
without an extra term on the right-hand side. 



A further improvement of the factor 2 in ( 18.21b to a constant 
c < 2 would have important consequences concerning the Siegel-zeros 
of Dirichlet's L-functions, as has been first pointed out by Rodosskij (cf. 
for example, Bombieri and Davenport [2]). 



It is easily checked to have, instead of (18.36b . that 



c 2 > 1, 



(8.40) 
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8. The Brun-Titchmarsh Theorem 



so that one has neater (I8.18t with 1 in place of — , it would be sufficient 

3 

to improve the constant - (at least in ( I2.9U ) (cf. notes for Chapter 
under ( I2.76t ) to a constant A satisfying 

A^ 2ci_1 = 1.32163- •• (8.41) 
4 

On the other hand (see the above remark) we point out that Selberg's 
sieve permits one to replace 13/15 in ( 18.181 ) by any constant C (with a 
c - c (C) in d3». 

Recently, starting from the works of Hooley and Motohashi (Hoo- 
ley 0, IH, and Motohashi [8], [9], [11]) there has been a remarkable 
progress with respect to the Brun-Titchmarsh theorem. Some of these 
results are concerning averages and certain others are valid only for 
some ranges of k. As an example, we mention one of the most recent 
results of the latter type (Goldfeld [4]): For every (sufficiently small) 
e > holds, with a certain c > 0, 

n(x; k, < (1 + e)— — ^— =- ).ifxi- C£ <k<x± (8.42) 
^)log(^= 



Chapter 9 

Selberg's Sieve 



NOW WE turn to the small sieves. The most elegant version of a small 
sieve is due to Selberg. In this chapter we present its simplest version 
with a view to clarifying the main ideas involved. Also, later in the next 
chapter we importance in the proof of the remarkable theorem of Chen. 

This sieve method can be considered as concerning the question of 
finding bounding for 



the number of elements in a (finite) sequence, depending on several pa- 
rameters, 



of (not necessarily distinct and not necessarily positive) integers, which 
are not divisible by any prime number < z, 



(9.1) 



sf := {a : ■ ■ ■ },a e Z. 



(9.2) 



z > (z € R), 



(9.3) 



belonging to a set of primes 



f :={/?:..•} 



(9.4) 



Introducing 




(9.5) 



pep 
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9. Selberg's Sieve 



we can restate this question as that of estimating 

S(£/,v,z) = \{a : a e £/,(a,P(z)) = 1}| (9.6) 

The required estimates would be naturally dependent on the various 
parameters describing stf, p, and also on z. However, we would be in- 
terested in bounds, for ( I9.6I ). which do not involve the special features 
of these defining arguments and p). To make this remark clearer, we 
introduce the following notation. 

Let 

£/d := {a : a e srf , a = OmodJ} for d e N. (9.7) 
First we choose a convenient approximation X to requiring 

X>1. (9.8) 

we write for the remainder 



Rx--W\-X. (9.9) 

Next, for each prime p € p, we choose u){p){e R) so that — — X is 

P 

close to \s^ p \, and set 

R P :U* P \-— X,Vpep. (9.10) 
P 

Further, denoting by 

V (9-11) 
the complement of p with respect to the set of all primes, we also put 

o)(p)= Vpep. (9.12) 

(This is consistent with our interest being only with regard to the distri- 
bution of numbers of si in the residue class modulo primes from p.) 
If we now define 

a)(d) := PI co(p) Vd e N with p(d) * 0, co(\) := 1 (9. 13) 

P \d 
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(and co(d) = if n{d) = 0), we see that 



(9.14) 
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and also that the definition 



Rd -=W\- — ~r x VdeN with fi(d) + 0, 



d 



(9.15) 



is consistent with ( 19.91 ) and A9.10I ). (Note that a> may depend on both 
and p. Now we can elaborate a little on our remarks made subsequent 
to ( I9.6I ). The estimates for i9. 6t are allowed to depend on X, to (and 
consequently on R), but not on the particular structures of £/ and p 
(apart from those which yields information towards the most appropriate 
choices for X and a> as introduced above). 

For the purposes of the method we also require to fulfill the condi- 
tion 

(^l) < < 1 - — with some constant Ai > 1 (9.16 a ) 
P Ai 

or equivalently 

(Oi) 1 < 7-T < A\ with some constant A\ > 1. 



which is well-defined (because of (Qi)). The product g(d) reminds us 
of the function occurring in Theorem 17. II but the advantage here is that 
to(p) need no longer be integer-valued. On the other hand, when com- 
pared with Theorem 17.21 the condition (Qi) already prevents us from 
dealing with 'too large a sieve' 
By d9.12t we see that 



p 



(9.16*) 



We introduce further 




(9.17) 



g(d) = if (d,p)±l. 



(9.18) 
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9. Selberg's Sieve 



and also (from ( 19.131 )) that 

g (d) = <^ u)(d) = 0. (9.19) 

(Here and in what follows (d, p) = 1 means that no p e p divides d.) 
Finally, we put 

W(z) := f](l - (9-20) 

p<z ^ 

G( Z ):=2^V)^)- (9-21) 

and more generally 

Gjfc(x):= ^ fi 2 (d)g(d), 0<iel,liel. (9.22) 

(rf,Jfc)=l 

In view of ( 19.61 ) we could start with the identity (cf. dl.20t ) 

S(£f,p,z)=Y J n(d); (9.23) 

a6£/ d\(a,P(z)) 

in fact, this is the sieve formula of Eratosthenes-Legendre. Selberg's 
sieve, for obtaining an upper bound for S(srf, p, z), consists in the intro- 
duction of arbitrary real numbers with the only condition 

^1 = 1. (9.24) 

which already implies that 

s{^,v,z)< Ad)2= Z Z (9 - 25) 

aes/ d\x d v \P(z)x aex? 

d\P(z) v=l,2 a=0mod[rfi,J 2 ] 



Hence, by fTTJ and d9~15l . 

) dy\P(z) " 1 '' " J d v \P(z) 12 

^ v=l,2 v=l,2[rfi,rf 2 ] 

(9.26) 



Ill 



say. With a view to keep Z2 small one takes in this method 

A d = for d > z 



(9.27) 



and then the remaining A'.s(2 < d < zd\P{z)) are chosen so as to mini- 
mize Yj 1 

This leads to the choice 



p-o)(p) G(z) 



(9.28) 



Note here that <E2H includes both d9~2"^ and d9~2"7t . Next, it can 

(9.29) 



be shown by the argument of (I3.26t that 

1^1 < 1. 

Now, with the choice A9.28I ). one obtains 

?-*b <930) 

and further ( I9.27t and ( I9.29t give 

2 rf v <z 

<*v|P(z) 
v=l,2 

Here the numbers <i = , c?2] are < z 2 and divide P(z). Since d is 
square free, the number of terms with the same d is atmost 

\{d\,d 2 : [di,d 2 ] - d\ - 3 v(rf) (9.32) 

From ( 19.261 ). d9.30t and ( I9.31t we now obtain (in view of ( I9.32t ) 

Theorem 9.1. (ill) 1 .- We have, in the above notation^ 

(St 



',P,z)<^j + Z l«[*A]l<^r 

v=l,2 

+ Z 3 v ^|/? d |<^+ Z fi\d)3 v ^\R d \. 



(9.33) 



rf<z 2 
d|P(z) 



rf<z 2 

w„p)=l 



'By this notation, which is also employed in a similar way later, we mean that the 
subsequent statement is valid subject to the conditions in parentheses. 



106 



112 



9. Selberg's Sieve 



Now we give two important special cases of Theorem l9.ll 
Theorem 9.2. Suppose that 

aj(d) = 1 and \R d \ < 1, if n{d) ± and (d, p) = 1. (9.34) 
Then 



X 



(IT(l-£))logz 

p<z 1 
pep 



+ z 2 . 



(9.35) 



Proof. From our assumption on a> in ( I9.34t it follows that the condition 
(Qi) is fulfilled and further 

p\d) 



where 



(d,k)=l 



-n 



(9.36) 



(9.37) 



Therefore, by ( 13.261 ). we have 

w(k) i-r 1 

G(z) > ^ log z = [)(!--) log z, 



(9.38) 



P<z 



so that first inequality of (E03l yields d9~33t (since |^| < 1 by d9"l4l ). 

Let us set 

Vk = {p : P \ K}, K e Z (9.39) 



□ 



Theorem 9.3. Let K(± 0) &e an even integer and suppose that 

j \ P , 

^(^) = r/ or ^ e Pjt- 

P- 1 



(9.40) 



77ie«, we have 

S(£/, VK ,z)<Q(K)-?-(l + 0(-}-))+ F[ //V)3 vrf l^l, (9.41) 
logz logz 11 
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where 



p>2 (P L) 2<p\K P L 



(9.42) 



Proof. We use the last bound given in ( 19.331 . Clearly, we need only 
show that 

G{z) 1J (p - l) 2 2 ^i, p - 2 log z log z 



since (fii) is satisfied with A\ = 2 in view of 21^. we note that 
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g(p) = 



1 



1 (1 + J = -^-(l+g(p))forp|£, (9.44) 



p - 2 p - 1 p - 2 cp(p) 
and so (cf. d9~T7t ) 



g(d) = -^~Y p 2 (0s(0 if (4 K) = l. (9.45) 



/k/ 



Then, by (13.261 



°.,K)=\ ' " 1 



(m,lK)=l 



'.,K)=\ 



(9.46) 

z 

on observing that for € > z the lower bound log {-), for the (empty) 



inner sum, is negative, Further, by (I9.44t and dl.lSt . 



1=1 

(l,K)=l 



p 2 (l)g(l) 

(0 



p\K 



1 



P(P - 2) 



(9.47) 



and 



(^,-f)=i (f,/^)=i 

_ v '°gp i n n 4- i \ 

- 2^ p (p _2) l+(p(p-2))-i 11 ^ + W-T)'- 

P fA 



(9.48) 
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9. Selberg's Sieve 



Using ( 19.471 ) and d9.48t in ( 19.461 ) we obtain 
G(z) > [1(1 - -) [1(1 + — ^-r-){logZ - V ' 0g /? 



(? - 1) 



2 " 



(9.49) 



p\K - tfK ' - p 

which upholds d9.43t . Thus the theorem is completely proved. □ 

NOTES 

Selberg's sieve occurs for the first time in Selberg (cf. Selberg 

0, 0,1a, 0). 

For the content of this chapter we refer the reader to Halberstam and 
109 Richert [ 1 1 (Chapter©. 

( 19.291 ): The details leading to ( I9.29t are the following. In view of 
d9.28t consider (for only squarefree d's) 



di<z/d 
(di ,d)=l 

= f 2 (.d 2 )g(d 2 ))(( Z n\d x )g{dx)). 

d2\d di<z/d 
(d u d)=\ 



(9.50) 



Now multiplying out the last expression and comparing with G(z) 

we obtain ( 19.291 ) 

19.11 cf. Halberstam and Richert [ 1 1 (Theorem 3.2) 

The observation due to Kobayashi, which we have mentioned earlier 

(cf. notes for Chapters and |SJ, consists in noticing 



G(z) Z 4* = Z Z'baM-nl) 



d\F(n) 
d\P(z) 



q<Z 1=1 



(9.51) 



h=\ 



p\q 



(F(h),q)=\ 



and 



-i 



'qJL\ 



(9.52) 



i=i 
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and using the duality principle (cf. d2.47t - (l2.48t ). Here F denotes an 
integer- valued polynomial and p(p) is the number of solutions of F(n) = 
Omod p. Actually, Kobayashi [ 1 ] proves the following dual form of the 
large sieve (in our notation) 

R R 

J] ^arei-nxrf = (N + 0(5- y ))\Y j a r \ 2 ,Va r e€, (9.53) 

M<n<M+N r=l r=l 

using the upper bound form of the large sieve as well as a smoothing 
technique, of Bombieri |U, for a lower bound. (The O-constant in 
< I9-53I > is absolute.) From this he derives Selberg's sieve (cf. Mathews 110 
|3*|). from which on using (I2.90t one obtains, instead of d9.33t . 

W,P,z)£^£. (9.54) 
G(z) 

(cf. Halberstam and Richert [ 1 ] (pp. 125-126)) a result that should be 
compared with MM . It is also possible similarly to get the stronger 
form d7.8t by defining /Ij's in Selberg's sieve in a different way (cf. 
Halberstam and Richert Q] (p. 126)). 

(19.331 : For the second inequality in ( 19.331 we have used that from 
d v \P(z), v - 1,2, one has d := [d\, dj\\P{z) and also for each such d 

\{d u <k ■ d v < d v \P(z), [d u d 2 ] = d}\ < \{d u d 2 : [d u d 2 ] = d] = 3 v(d) 

(9.55) 

19.21 cf. Halberstam and Richert [ 1 ] (Theorem 3.3) 
1931 cf. Halberstam and Richert Q] (Theorem 3.10) 



Chapter 10 



Some Applications of the 

Small Sieve in the Case 

P 

u(p) = 7 

P~ 1 



AS ALREADY mentioned at the beginning of Chapter [5] we present 111 
here some applications of the results obtained there in the chapter for 
our purposes. Theorem 19.31 which is not contained in the arithmetical 
form of the large sieve (cf. the remark made subsequent to ( 19.171 ). is of 
particular interest. 

We have the following two interesting (cf. Notes) applications, 
which we shall quote without any details of proof. 

Theorem 10.1. We have, as N — > oo, 

./V loslosA^ 
Hp : p < N,p + h = p'\ < 4S(/j)^— {1 + Q( f I )}, 

log 2 // logN 

VheZ,h±0, h = Omodl. (10.1) 
uniformly in h, and also 



\{p:p<N,N-p = p'\ < 46(^)^^(1 + 0( f__° )}, 



'log 2 Ar logN 



117 



118 



10. Some Applications of the Small Sieve. 



forN = 0mod2. (10.2) 
where & is defined through (19.421 . 

More generally one also has 
Theorem 10.2. Let A > and let a, b, k, I he integers satisfying 

ah ±0,(a,b) = l,ab = 0mod2. (10.3) 

and 

(k,i) = 1, 1 < k < log A i(R3i> jc ). (10.4) 
Then we have, uniformly in a, b, k and I, as x — > oo 

\{p : p < x,p = (moAk,ap+b = p'}\ < AQ(abk) {l+0 A ( l0gl ° gX )}. 

y{k)\og-x logx 

(10.5) 

A much more delicate application is the next theorem which shall be 
112 used in Chapter fOJ Though some of the majorizations in the proof of 
Theorem 1 10. 31 are crude, many others involve rather delicate considera- 
tions. Before coming to the formulation of this theorem we shall obtain 
some useful auxiliary results. 

Lemma 10.1. We have 

,2 



V ^M/jV(<?) < h (i ogx+ if f or x > i, h e N. (10.6) 

^ q 

q<x 1 

Proof. Consider 

£ (M h m < Y\(i + -) < f[(i + -) h . (io.7) 



p<x r p<x 



Now d!0.6t is apparent in view of the well-known formula due to 
Mertens, 

fit 1 - -) = f—( l + °(r—»- ^ 10 - 8 ) 

1 J- p logx iOSX 

p<x ° 

on noting that (1 + — ) < (1 - □ 
P q 
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Lemma 10.2. Let A > and let h, k e N. Let K < \og A xfor sufficiently 
large x. Set 

E{x, d) := max \n{x; d, I) - — |. (10.9) 
(€,d)=i (p(d) 

Then for any Ui(> 0) there exists a value C\ = C\{U\,h,A) such 

that 

V f{d)h v ^E{x,dk)^ UuhA X . (10.10) 

u< J* — 

Proof. By the rough estimate (cf. dl0.9t ) 

E(x,di)^^- for di<x (10.11) 

and an application of the Cauchy's inequality followed by extensions of 
ranges for variables in the resulting summations, we see that the expres- 
sion on the left-hand side in dlO.lOt is 

x i ^ u 2 (d)h 2v(d) i i 
«(t) 5 (2^ L ^ )5( Zj E ( x > d »~ 2 - ( 10 - 12 ) 



d£x d<-$- 



In view of the bound given by (110.61) for the first sum here, we make 
the choice of C\, for given U\, by means of ( 16.231 ) such that the second 
sum is bounded by x(log x)~ (2U[+h2+A \ Then, continuing the estimation 
in (I10.12t . we obtain further 

« (j)klogxy2 h \x(logxf u > +h2+A ) l 2 « xQogx) Ul -* A (tp(k)r*- 

Ui,h,A k U[,hA 

(10.13) 

which proves dlO.lOt because <p(k) < k < (log x) A . □ 
Now we are in a position to prove the main theorem of this chapter. 
Theorem 10.3. Let v be a real number satisfying 

v>3. (10.14) 
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10. Some Applications of the Small Sieve. 



Let h{± 0) satisfy, being determined with respect to sufficiently large 

x. 

heZ,h = Omod 2, and either h = [x] or < \h\ < x 1/3 . 

(10.15) 

Then, as x — > oo, holds 

'C v (x,h) := \{\h - p\pip?\ : \h - p\p 2 p?,\ = P,x 1/v 
< pi < X 1 ' 3 <p 2 < P3,P1P2P3 < x}\ < (10.16) 
< 4c (v ) (5 Wl ^{l + O v( ^)}. 

where (5 is defined by 09.42b and 

1/3 

f log(2 - 3a) 

c(v) := — —da (10.17) 

J o-(l - a) 

l.v 

(Note that the Oconstant in dlO.161 ) depends at most on v.) 
Proof. Let us consider (the finite sequence) 

3g;={b:b = pd,de$,p<-}. (10.18) 

d 



114 where 



& ■= : d = PlP2 ,x 1/v < Pl < x 1 ' 3 < P2 < J-^-j. (10.19) 

We note that each d e S> (has a unique representation as p\P2 and) 
satisfies 

< x3 + " < d = Pi/>2 < yfpix < x?. (10.20) 

and so 

\@\<x%. (10.21) 

With a view to determine a suitable approximation to \&\, we use 

the formula (taking p = - and p — —) 
v 3 

V I = log(-^) + 0(— —)fory>xP, (10.22) 
J^ <y P Plogx plogx 
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for Stieltjes integration to obtain (with p — — ) 



x/pi 

+ o( 1 ). (10.23) 



, p 2 \og x -£ J 77 log 7/ log ^ log 2 x 

Further multiplying (110.231) by — and summing over x 1 ^' < p\ < 

Pi 

x 1 / 3 we get, by use of dl0.22t (with p = 1/v), 

,1/3 (f) 1/2 

1/3 (l-a)/2 

log* J a J ftl-a-fl ^ lJyK \og 2 x ) 
1/v 1/3 
l 

= l f l2S£z3£)j a + o ( 1 ) = £0± + O v (-i-), 

log* J a(l-or) lv log 2 A: / log* vv log 2 A: / ' 

V 

(10.24) 

where we have put 

§ = jf,Ti = J. (10.25) 

and have also used the notation (I10.17I) . Thus since every d € ^ is 

< jc 2 ^ 3 (cf. ( 110.201 ). by dlO.lBt and the prime-number theorem (in a 115 

weak from) one has 



x " -(^-))^d + o v( J-) 

<fe^ ev^ log^ log* log* 



(10.26) 

on using (I10.24t . (Note that the formula (110.26b counts the numbers in 
g$ according to the multiplicity of their occurrence.) □ 

Towards the estimate (110.171 1. we naturally consider 

S(^,v h ,z), (10.27) 
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10. Some Applications of the Small Sieve. 



where 

si := {\h-pip 2 p3\ ■ X UV <P\< X i/3 < p 2 < P3,Pl,P2,P3 < *}, 

(10.28) 

V h :={p:p\ h}, (10.29) 

and (we make the choice) 

z 2 = xU~™ (10.30) 

with the abbreviation 

^:L = logx. (10.31) 
Instead of si it is more convenient (cf. ( 110.601 ) to work with 

sT := {\h - pip 2 Ps\ ■ * Uv <Pi< x 1 ' 3 <p 2 < J—,PiP2P3 < x). 

V Pi 

(10.32) 

Now the primes = 1,2,3) occurring in si satisfy P1P2 < x 

and so p2 < A—, which shows that si is contained in si* (even with 
V P\ 

regard to multiplicity of numbers in it). Therefore 

S(s/,Vh,z)<S(si*,Vh,z). (10.33) 

Note also that all the elements of si* have the same type of repre- 
116 sentation from among 

h - P\P2P3,PiP2P3 ~ h or \h\ + pip 2 p3 (10.34) 

according as h = [x], \h\ < x 1 ' 3 with h > or h < respectively (cf. 
( 110.151 )) and hence, in particular, that the multiplicity of a number in s/* 
is exactly the multiplicity of the corresponding P1P2P3 in 

Next, we prepare for an application of Theorem l9. 3l with respect si* , 
P/, and z. Comparing the definitions of si* and 2$ (through we see 
that, for (q, h) = 1 with fi(q) + 0, by (11.531) (in the notation of Chapter 
|5J) one has 

4*1 = Z 1 = as Z Z = 

feftmod^ (10 35) 

= Z*o(&) + ^ Z 

X*Xo 
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in view of the remark involving dl0.34t . From here we get 

X*XO (b.q)>\ 

(10.36) 

say. In particular 

\sT\ = = \m, (10.37) 
which is clear otherwise also. Therefore we make the choices 

w(p) = -^forp e p h (10.38) 



and, by (fT026t . 



X = |^| = c(v)-^-(l + O v (^-)). (10.39) 
log x log X 



Now, S(stf, Vh,z) counts all numbers of (the set in) ( 110.161 which 
exceed z and so (because of (110.331) ) applying Theorem 19 . 3 1 with K - h 
for £/* it follows, by d93TV STU1% and SWM . 



x log log X 

v)ix>yri)— 1 " ' 

lc 

where (cf. (110.361) ) 



C v (x,h) < 4c(v),@{h)—^—{\ + O v { °f ° gX )} + V (10.40) 
log 2 x log ^ 



£:=z + ^ /z 2 ( ? )3^ ) |R,|. (10.41) 

<7<z 2 
(<?,*)=! 



The rest of the proof concerns with an estimation of 2o which shows 
this contribution to d!0.40t as being of the nature of an error-term. The 
sum in (110.4 U is, by Cauchy's inequality and (I10.36I) . 



< ( Aq)9 v(q) \R q \)k 2 » 2 (v) R */ 2 - ( 10 - 42 ) 

q<z 2 q<z 2 
(q,h)=l (q,h)=l 
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Now trivially, from dlU.35ft . d!0.26ft . we have (cf. H10.15II . cf. also 

dTnmi» 



3C I 3C 



\R q \ «v ( + — - -: ) «v - (10.43) 

so that the first sum in dl0.42l > is, by Lemma flO.il 

= O v (x\og 9 x). (10.44) 

(Observe that the first term of the middle expression in (110.43ft is ob- 
tained by using the fact that the multiplicity of any member of Si* is 
absolutely bounded (cf. (I10.34I I).) Next we deal with the simpler part, 
of the second sum in (110.42ft . arising from the second term defining R*. 
We have for q < x with q squarefree, 



E i<Zi+ 2 2 i<v(q)(\®\ + x 1 -- 2 i) 

•eSS de3> . 
,q)>\ p\q 

«, jc 1_1/, 'logx, 



(i,g)>l p|g ^"Spi^^J F Pin x'l'<p 2 <x^ 

pilg 



(10.45) 

on using (I10.21I) . (110.22ft . (110.15ft and the trivial estimate 

vfa) < 5-^! Vfl € N. (10.46) 
log 2 * 



Hence (110.45ft leads to the estimate, for the part of the sum in ( 110.42ft 
which in under consideration via (18.321) 



log x V ^ « v X 1 "; log 2 x. (10.47) 



Now collecting together the bounds ( 110.42ft . ( 110.44ft and dlO.471 ) we 
see that because of the choice dl0.30ft . 

2 

2_j Z 2 + xlog 9 x(x l ~~ log 2 x + 2J) x 2 ~~ log 11 x + xlog 9 X. 
1 I 

(10.48) 
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where (cf. (TiTHoT i) 



Z-Z*$ZiZ*»* 

X*XO 



(10.49) 



Observe that /j has no longer a part to play in the sequel. 
Transition to primitive characters in dl0.49t yields (cf. ( U.59I 0. on 
writing q-rf(J: conductor x m °d q) 



Z,l Z, ^ Z, ^f) lu 



E I E *(*)! = Z ^Z 2 M, 



(>•,/)= i 
p 2 (/) 



Z *(*)!< 

A mod / be3S 
X*Xo (b,r)=\ 

li\r) 



(10.50) 



r<z L 



<p(r) 



<p(f) 



f<z 2 

where (on replacing / by q) 



Xmod f beSS 
X*X0 Q>,r)=\ 



r<z L 



<p(r) 



z>=-z£f ZiZ*»» 

2 q<z 2 ^ A" mod 4 fes^ 

**#0 (M=i 



(10.51) 



(In the remaining part we also use the abbreviation d!0.31t wherever 
convenient.) 

By the Siegel-Walfisz theorem (cf. ( I8.2t ) we have for any character 
X + Xo m °d <7> m v i ew of ( 11.44ft and d!0.46t . 



p<y 



l °Sr 



• : 



log 2 



uniformly for q <sc log g j as y — > oo, 
so that (with g = 17) 



(10.52) 



y Qg) y 



7 ¥>(«) 



^ mod £/ 



Z *(*) 



« Z Z ^(<?K 51 «v ^ 16 

q<(" deS> 



_ y Y 

,17 ^ j 



Z *(<0 Z *0>)l< 

de@ p<x/d, 
X*Xo' V,r)=l 



(10.53) 
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Hence 

£W «v Z (r) + Xrl6 ' (10 - 54) 



where 

" f 2 («> V v (10 . 55) 



Z«> - Z Z Z *» 

(i>,r)=l 



For an estimation of (110.551) we use contour integration and the hy- 
brid sieve. To this and we put 

T = x\ (10.56) 

as well as 

a=\ + C x . (10.57) 

Further let 

l xl < w < z 2 (10.58) 
and also note that the supposition 

x = [x] + i (10.59) 
involves no loss of generality. We also introduce the Dirichlet series 



P = P 



p<w p>w 

p-fr p-fr 

D - D r (^) := J] > ( 10 - 6 °) 

(<M=1 

Now, by (110.191 and a well-known formula (using ( I10.59t ) 

a+iT 

X X(d) = f(P + Q)D{s)-ds + <9(^^). (10.61) 
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Splitting the integral here into two parts, corresponding to (PD)(s) 
and (QD)(s) respectively, and shifting the line of integration of the for- 
mer only to cr = — we obtain, on using dl0.57t . 

T 1/2 

2 \(PD)(\ + it, X )\A\ + Y f \(PD)((r±iT, X )\da+ 

beSg _r " a 

(b,r)=l 
T 

+x j \(QD)(a + it, X )\^ + ^. 

(10.62) 

For the second term on the right-hand side the crude estimate 
\{PD)(cr ± it, X )\ < J] J] 1 < w 2 \&\ < /for- < cr < a, (10.63) 

p<w 2 dt@ 

obtained from (TRUTH. (I10.59L (THHlT l and (TITH^ . suffices. 

Towards an estimation of the remaining terms we introduce a nota- 
tion (for convenience of description). For an arbitrary function f(s,x), 
s = cr + it we set (with respect to r, w, T and z as before) 

* r 

"<*/> : = I ±) ^ J^^'iTm' (ia64) 

(?,r)= 1 1 

and observe that, by Cauchy-Schwarz inequality, for any two o(such) 
functions /] and fi one has 

M(cr, f x fi) < (M(cr, / 1 2 ))i(M( C r, / 2 2 ))4 . (10.65) 
We would also need the estimate (valid with an absolute «c-constant) 

M(<r,f) « |](w 2 log* + / 3 )|ajV^,for/ = f(s, X ) = £ J*(n) if J < °°> 

h=1 n=l n=l 

(10.66) 

which is obtained by partial integration in Theorem 15.11 and using 121 

mm, . 
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By dl 0.6211 . (110.64b . (ITU351 and (THBoT l there holds 

+x(M(a, G 2 )M(a, £> 2 ))3 + w 2 . (10.67) 

Now we apply dl 0.661 ) to the four M's occurring in dl 0.671 ) to obtain 
in succession (with absolute <sc-constants) 

M(i P 2 ) « J] (w 2 log x + /j)^" 1 « w 2 log 2 x, (10.68) 

p<w 2 

because of (fTO60l> . (fT058l) and (fTtBOl ). 

M(^,D 2 )« ^(w 2 logx + J)J _1 «w 2 log 2 x + x3, (10.69) 

deS> 

using in addition dl0.21l) further (cf. dl0.57l) ) 

M(a, Q 2 ) « ( w2 1o § x + P)^' 20 lo § x > (10.70) 

p>w 2 

and lastly 

M(a, D 2 ) « ^(w 2 log x + d)d~ 2a « w 2 (log 2 x)x"5 + log x. (10.71) 

Using the four estimates dl0.681 >— (TlO.7 ll > in dl0.67t we arrive (cf. 
([10581 . (ITCTl ) 

Z jfc i I I ^)|«(xw 2 log 2 x(w 2 log 2 x + x5))^ 

f 17 <a<tt> ^mod^ &e,^ 
0>,r)=l 

' 2 9 —1 i 9 1 I 

+x(log x(w x 3)) 2 + w <s x2wlogx(wlog +X + X3) 

+x log x(wx"s + 1) + w 2 «; x log x + wx 5 6 log x + w 2 x5 log 2 X, 

(10.72) 
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valid uniformly in r. This gives, by partial summation and dl0.30l >. 

1 * 

X ~n 2 1 Z <<: xrl6 + xi ^ 2 <<: xrl6 (10 - 73) 

(b,r)=l 

uniformly in r. 

Note that the left-hand side expression in dlO.731 ) majorizes ^(r) of 
(110.551) . Therefore, it follows from dl().54i that 

Yj,r) « v xC 16 , (10.74) 

2 

which when used in d!0.50t yields (by means of ( 18.32ft ) the estimate 

^ « v xr 15 . (10.75) 
I 

Thus we obtain, on using dlO.751 ) in d!0.48t . 

£ « v x(log^)~ 3 , (10.76) 
o 

from which in view of d!0.40t . follows the estimate in dl0.16l) . This 
completes the proof if Theorem ll0.3l 

NOTES 

110.11 The results of this theorem were proved first under the as- 
sumption of the generalized Riemann hypothesis by Wang [2|, and the 
unconditional proof of these results (with an application to have (16.77b 

with a 9 < -) is due to Bombieri and Davenport [ 1 ]. The main terms of 



dlO.U and dlO.21) are 4 times the conjectured asymptotic formulae (with- 
out error terms), for the respective problems, of Hardy and Littlehood. 
In this context it is significant to mention that Montgomery has pointed 
out (in correspondence) that a decrease of the factor 4 here any constant 
c < 4 would have the same consequences regarding the Siegel-zeros as 
has been remarked in connection with the Brun-Titchmarsh theorem (cf . 
Notes of Chapter^] under dOTT i). 
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Theorem [TTHl See Klimov Q, 0, and Halberstam and Richert [ 1 ] 123 
(Theorem 3.12). 

(110.81) : The formula (110.8b though not needed in this full force for 
Lemma [l0.11 is of much use in later chapters. 

Lemma [T0.2l cf. Halberstam and Richert [ 1 1 (Lemma 3.5) 

Theorem ll0.3l This result is the most essential part of Chen's proof 
of his famous theorem with respect to Goldach's conjecture. For v = 10 
it occurs in Chen [1], Halberstam and Richert [1] (Chapter ITT1 with 
some simplifications due to RM. Ross) (with the weights A(«)), and in 
Ross (U. We shall use Theorem 110.31 with v = 8 in Chapter [T3"l and 
thereby obtaining the advantage of dealing with elementary functions 
in connection with the lower bound estimation via Selberg's sieve, (cf. 
Notes of Chapter[T21 preceding (113.27b .) 

(I10.17t : For later use we obtain an estimate for c(v). First, note 
that the function f(x) = x 2 - 2x log x is increasing for x > 1 (since the 
derivative /'(*) = 2(x - 1 - log*) > for x > 1). So f(x) > /(l), for 
x > 1, which means that x 2 - 2xlog x > 1; i.e., 

(x- l)(x + 1) 
log x < £ - for x > 1. (10.77) 

2x 

(However, observe also that (110.771) follows from 



I 



(b-a) 

F(y)dy < — - — (F(b) + F(a)) for any convex function F,b > a, 



(10.77)' 

on taking F(y) = y~ l and a = 1, b = x(> 1).) Hence dlO.771 ). with 
x - 2 - 3a, gives 

log(2 - 3a) 1 - 3a 3 3 1 3 „ „ 1 

— — < -( ) for < a < -, 

a(l - a) 2 a(2 - 3a) 4 a 2 - 3a 3 

(10.78) 

from which, we obtain 



= f logJ2-3o) ^ ^ 3 _ r I = 3 log( _!L_^). 

J a(l-a x - 



) - 4 c »=v 4 3(2v 3) 

(10.79) 
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dl0.61t : Titchmarsh, E.C. The theory of the Riemann zeta function 124 
(Oxford), 

Lemma 3.12. 

The effective way of treating the remainder terms by an explicit use 
of analytical methods, as done here subsequent to (110.61b . is of recent 
origin. It occurs in the papers Barban and Vehow (Q (see Motohashi 
DDI) Hooley Q. Huxley 0, Chen Q. Motohashi |8). Halber- 
stam and Richert [ 1 ]. Goldfeld [4], and Ross [ 1 ]. Excepting the first of 
these paper all the others employ this method for the purpose of some 
applications only. In that paper Barban and Vehov [ 1 ] sketch a proof (- a 
rigorous proof was given by Motohashi [ 10]-) of the following surpris- 
ingly uniform result: 

If x > z, log z » log z\ , z\ >z and 




(10.80) 



then we have 




(10.81) 



Chapter 11 



A Generalized form of 
Selberg's Sieve 

IN CHAPTER[5]we discussed a simple version of Selberg's sieve along 125 
with two particular cases, corresponding to the choices 

P 

0){p) = 1 and co(p) = (for/? € p), (H-l) 

p-1 

and applied the latter, in the next chapter, to obtain the important Theo- 
rem ll0.3l for the purposes of ChapterEl In this chapter we continue the 
theme of the small sieve of Selberg with a view to enunciating it in its 
best form (in a certain sense), with is useful also in the proofs of the re- 
sults of Chapter^] However, our account of this aspect of the Selberg's 
sieve here will be sketchy (with relevant references being included in the 
Notes). 

At the out we recall that Selberg's Theorem 19. II was proved subject 
to only the condition (Qi) (cf. ( |(9.16 a )| l a) which restriction can also 
stated in the form 

< co(p) < Sp for some 8 < 1. (11.2) 

(Though dll.2t does not make Selberg's sieve 'too large a sieve', nev- 
ertheless it leaves the sieve a 'large' one still.) However, to effectively 
deal with the function G{z) and the remainder terms of Theorem 9.1 one 
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11. A Generalized form of Selberg's Sieve 



needs information concerning the average order of magnitude of oj (and 
also of /?rf's). In this connection the condition 

v~i oj( p) log p z 
(£l 2 (k,L))-L< > 1 B1 -Hog(-) < A 2 for2 < w < z, (11.3) 

w<p<z 1 

which tells us that o> is 'on the average' equal to k is useful (when R/s 
are 'small' atleast on the average) for obtaining both the upper and lower 
estimates for S{&?, p, z). Then the O-constants in these estimates are 
allowed to depend on Ai(froimQi)) and k (and hence, comparing with 
(111.2b . the sieve is 'small') A 2 (from (f2 2 (&, L))), (and some constants 
implicit in the restrictions on R/s (cf. ( II 1.62» ) but not on any other pa- 
rameter involved (in particular, independent of L, which is of the nature 
of an error term, in practice thereby requiring a separate consideration). 

The constant k in (111.31 1 is called the 'dimension' of the sieve prob- 
lem. 

As we shall see later the equation of obtaining lower bounds for 
our sifting function S p, z) can be linked up, in a significant way 
with the problem of finding good upper estimates for it. Accordingly, 
we now deal with the latter problem. First, we mention that when one 
combines for this purpose, Theorem 19. 1 1 with Q.2(k,L), or instead even 
with the one - sided restriction 

(Q 2 <») y ^(l)logp <*log(-)+A 2 if 2<w<z, (11.4) 

tr J P W 

w<p<z 

the results obtained are quite satisfactory. In fact, an elementary reason- 
ing gives, under the condition (Q 2 (^)) (in addition to (Qi)), the estimate 

1 «W(z) (11.5) 



G(z) 

for the function G(z) of Theorem l9.1l in terms if W(z) defined by d9.20t . 
and with a little more effort, on using (^(/c, L)) in place of Q 2 (^) here, 
one obtains 

! W(z)e yK Y{K+ 1){1 + 0(-^-)j. (11.6) 



G{z) logz 
Thus we get, by means of (the last inequality of) Theorem 19.11 the 
following two theorems (respectively): 
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Theorem 11.1. (QO, (Q 2 W) : 

S(£/,v,z)«XW{z)+ J] /iV)3 v(rf) l^l (H.7) 



d<z L 
(d,p)=l 



and 

Theorem 11.2. (Hi), (Q. 2 (k,L)) : 

S(£f, p,z) < XW(z)e r T(i + 1){ 1 + <9(-^-)} + V ^ 2 (^)3 V(£/) |^|. 

logz f-i 

(11.8) 

Here, again, we point out the Oconstants in these two theorems 
depend atmost on A\, A 2 and k inherent in (Qj) and Q(/c, L) (or 2 (/c)). 

Next, in accordance with the remark involving dl 1.3b . we consider 
the question of the magnitude of R/s There are many cases in which 
one has the following information 

(R) \R d \ < (o{d) if fi(d) * and (d.p) = 1. (11.9) 

In such a situation one readily obtains, from (the second inequality 
of) Theorem EH1 and JTT51 . 



Theorem 11.3. (Qi), (Q 2 (/0), (R): For any A > 

Sis/, v, z) « X ]~[(1 - ^) i/z < X A , 

where the ^-constant depends almost on A, A\, A 2 and k. 



In the literature, usually, the phrase 'by Brun's sieve. . . ' refers to 
the statement (111.10b . Here notice that Theorem II 1.31 is a available the 
more convenient condition, instead of (Q 2 (#)), 

(Q ) co(p)<A Q (11.11) 

(since (Qo)) implies (Q 2 (/c))). Similar to Theorem lll.3l on using (111.6b 
with Theorem 19.1 K ef. (I10.8t ). one has the explicit 
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Theorem 11.4. (Qi), (n 2 (/c,L)), (R): 

oj(p) s „ 1 . X 



sw, p,z) < t(k + 1) r[{(i - -^xi - -) 

11 n n 



P 



P log^ 



log log 3z + L 



U + 0( ° f )}ifz<X\ (11.12) 

where the infinite product converges and the O-constant depends atmost 
on A\, A2 and k. 

On the other hand, in absence of the information (/?), as happens 
with more delicate problems, one has to seek atleast an average result 
about Rj's (cf. the remark containing ( 111.3b ). It is at this stage that the 
Bombieri-type results (cf. Chapter |6j 3) are effective. It is easily seen 
from the last sums in Theorem 111.11 and 1 1 1 .21 that the size of z 2 (which 
is about 

(11.13) 

log c X 

usually) is very important, since a smaller choice of z increases the lead- 
ing term view of the factor W(z). For making this remark a little more 
explicit notice that under (R) one could z 2 < X (cf. dll.121 )). whereas 
the use of Bombieri's theorem allows us, for example, upto the bound 
<11.13b . so that 

z 2 < -f" (11-14) 
log c X 

which worsens the leading term in (111.8b by a factor of 2" (in view of the 
fact that W( z ) behaves like c(/c) log7* under Q.2(k, L)). However, if we ask 
for a bound to the primes represented by an (irreducible) integer- valued 
polynomial F then we can sieve the sequence {F{p)}. Then (leaving 
minor details apart) the dimension of the problem would be 1 and one 
has to use Bombieri's theorem instead of (/?), thereby (apparently) lose 
a factor 2. Instead one can also sieve (as was necessary before the avail- 
ability of Bombieri's theorem) the sequence {nF{n)}. In that event, the 
dimension becomes 2 and consequently one loses a factor 4 instead of 2. 
This example provides an instance of how Bombieri's theorem permits 
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linearizing a problem (-i.e., reduce the dimension by one-) and thus save 
a factor of 2 in the upper estimate. 

Now returning to our problem of obtaining good upper estimates, 
with a view to achieve lower bounds (which are far more important), 
for S(£/, v,z) we find that it is helpful (cf. dll.32t ) to generalize the 
method of Chapter [5] by the introduction of a new parameter (which is 
possible because of the dual role of z there expressed through d\P(z) and 
d < z) in the following way. 

Again we start with (I9.26t which holds true under the single condi- 
tion 

M = h (11.15) 
and then require ( 19.271 ) with respect to some arbitrary 



f>l, (11.16) 
instead of the z inherent in S (sd ', p, z), i.e., 



Ad = for d>l. (11.17) 

(Note that (111.171 is consistent with ( 111.151 because of d!1.16t .) Pro- 
ceeding as in Chapter[5]one is now led to the choices 

^ :=M ^n_^.^H ( n.i8) 



where 



G k (x,z):= Sid), G{x,z) := Gi(x,z) for < x € R (11.19) 



d<x 
d\P(z) 
(d,k)=l 



are generalizations of the functions in d9.22t (and (I9.21t ). Also note here 
that dll.lSt includes both d!1.15t and d!1.17t . Now again, as before, 
one has 

\A d \<l. (11.20) 
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Further, with the choice dll.l8t . we get d9.26t with 

1 



E 



and also, corresponding to ( I9.3U . 



(11.21) 



^ < ^ Kd 2 \ < £ 3V( "W (H-22) 

dv\P(z) d\P(z) 
v=l,2 

because of dl 1.201 ). dll.171 ) and d9.32l ). Now we see that £ has, so to 
speak, taken over from z the role of controlling the order of magnitude of 
the remainder sum (cf. the remark preceding the statement of ( 111.151 ). 

Also we would formulate (cf. again dll.32t ) Theorem 19. 1 1 general- 
ized further so as include all £/q's for q's restricted by 

(0 M<7) * 0,(q,P(z)) = 1,(?,P) = 1. (11.23) 

Here we stress that £/ q 's are. related to £f through the approxima- 
tions required by (19.15b (and (I9.9t ) and consequently this step is not 
merely a change of notation. The condition (Q) ensures, for this step, 
that the only changes required in the previous generalization of Theorem 
19. H are the replacements of 

l q l 

and of 

R' d s by R' qd s. (11.25) 

Thus from (EH61 (fTT2Tt . d11.24t . tTT13 and d11.25t . in view 
( II 1.161 ). the required generalization of Theorem 19. II follows; namely, 
one has 

Theorem 11.5. (fli), (Q): For any real number £ > 1, 

5K,p,z) < —t^- + Y y (d) \R qu \. (11-26) 

d\P(z) 
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Actually, for q = 1 and g — z, (lll.26t is the second inequality of 
Theorem \9. l\ since 

st\=&t (11.27) 

and 

G{z,z) = G{z). (11.28) 

Here, by imposing the condition (Q2W) alone we cannot expect to 
get very useful results. However,as a starting point for the method of 
obtaining a lower bound for our sifting function, we use (Q2M) on ly 

to derive a simple estimate for so that it follows from Theorem 



>2, (11.29) 



Theorem 11.6. (Qi), (Q. 2 (k)), (Q): For 

logz 
there holds 

5K, p,z) < — XW(z)jl + 0(exp{-^(log I + 2)})} + £ 3 yW |/?, c/ |. 

(11.30) 

Now we are in a position to point out how by means of a certain very 
effective combinatorial argument (in the from of identities concerning 
our function S(&/ q , p,z) and W(z)), which is really a result about ar- 
rangements from mathematical logic,one can obtain some lower bounds 
and also improved estimates for our sifting functions from the estimates 
of Theorem II 1.61 

Buchstab was the first to notice the fruitful utility of this combina- 
torial result for the purposes of sieve methods. We state this result as 

Lemma 11.1. (0: // 

2<zi<z, (11.31) 

then we have 

S(£f q ,p,z)=S(£f q ,?p,zi)- S(£/ qp ,v,p) (11.32) 

Zl<p<Z 

pep 
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132 



as well as 



Zoj(p) 
— W(p) 
p 

Zl<p<Z 1 



(11.33) 



Proof. Let p\,p%. . . be all the primes belonging of p, written in their 
natural order, which are greater than or equal to zi. If z < Pi we have, 
by d97T5l and (FTTH 



P(z) = P(zi) and oj( P ) = for z\ < p < z. 
This gives, by d9.6t and ( 19.201 . 



(11.34) 



S«, P,z) = S«,P,zi) and W(z) - W( Zl ), if z < pi- (11.35) 



Thus Lemma [Tl.ll is trivially true in this case {z\ <z < pi). 
Now suppose that p\ < z, so that defining the integer N by p^ < z < 
p N+1 we have N > I. Then, for each integer v = l,...,N, d9 .61 ) yields 



5(^,p v+ i)-5(i4,p y ) = 

= |{a : a e = Omod p v ,(a,P(p 7 )) = 1}| 

and, by ( 19.20ft . we have 

W(p v+ i) - W(p v ) - -^W v ). 



(11.36) 



(11.37) 
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Summing up the identities ( 111.361 ) and dll.37t over v = 1, , . . ,N 
and observing that S(s^ q , p,z) - S(£/ q , Pjv+i) and W(z) = W(pn+i) we 
obtain dl 1.321 ) and dl 1.33I ). This completes the proof of the lemma. 

To see as to how dl 1 .32ft links up the problem of obtaining a lower 
bound for sifting functions to that of having good upper estimates, sup- 
pose that one has a lower bound for (the larger) S(s^ q , p, z\). Then up- 
per bounds for S(£/ qp , p, p)'s enable us to obtain a lower bound for (the 
smaller) S{s$ q , p, z). (We also have a similar remark the problem of ob- 
taining upper bounds by means of dl 1.32b .') However, the significant 
part of dl 1.321 ) is its iterative aspect consisting of using dll.321 ) to (some 
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of) the 5 -functions with (the respective) p's in place of z, thereby ob- 
taining more terms of both signs which in turn (after 'many' iterations) 
would provide a more effective scheme for the above procedure. Actu- 
ally , it is again this iterative process because of which (II 1.321 1 is stated 
with a (general) q (rather than with q = 1). 

Since, to start with, we do not have general lower bound for S(g/ q , 
p, zi) one can only make the choice 

zi=2 (11.38) 

so that, by (I97T31 . 

S& q ,p,zi) = U* 9 \ = —X + R q . (11.39) 
q 

Then, on multiplying (II 1.331 by — ^- X and subtracting the result 

q 

from (111.321 1 we are led to deal with the remainders Rg/s only. Now, in 
order not to accumulate too many terms from the upper estimate for the 
sum in (111.321 1. we use Theorem II 1.51 for S(^/ qp , p,p) with £ 2 replaced 
£q/P (and also with p instead of z, which ensures (Q) for qp in place 
of q). Thus we arrive at a first step lower bound corresponding to the 
upper bound of Theorem 111.61 and so we state this result in the of an 
asymptotic equality: □ 

Theorem 11.7. (Hi), (Q 2 M), (0- For 

t:=-^->2. (11.40) 
logz 

there holds 

S«,P,z) = ^XW(z){l + (9(exp{-^(log^+2)})} + £ 3^1^1-101 < 1. 

q d<e 

d\P(z) 

(11.41) 

It should be mentioned here that, with q = 1 and a suitable choice 

for 

One easily obtains from Theorem l 11.71 under the condition (R), the 134 
so-called 'Fundamental Lemma' (cf. Kubiliyus 0, Lemma 1.4) 
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Theorem 11.8. (A), (fi 2 (&)), (/?)■' For 



logX 



> 1. 



(11.42) 



u := 



logz 



there hold 



S(s/,p,z) =XW(z){l + 0(e"5" lo g") + 0(e"Vi°g^)} (11.43) 



Observe that the preceding results (i.e., Theorem 111.61 111.71 and 
II 1 . 8I > are significant only if u (or r) is large (which means that z should 
be small in comparison with X (or £)). The reason for this limitation 
is, apart from the weak condition (Q2OO), mainly due to the fact that 
Lemma [Tl.ll has been used with a 'trivial' choice for z\ in deriving the 
lower estimate. However, using the stronger Q.2(x, L)) instead of (Q2M) 
we obtain a more precise information about G(^, z). And still more im- 
portant is that Theorem 111.71 enables us to employ Lemma H 1.11 with a 
'non trivial' choice for z\ (cf. dll.52t ). 

From this point onwards we shall confine ourselves to the case of 
dimension 



and we owe some explanation for doing so. First of all, the upper esti- 
mates for G(g, z) under (Q.(k, L)) in the case of general dimension k (and 
then so all for S(£/, p,z) via Theorem II 1.5t become quite complicated 
and further, they do not further, they do not yield satisfactory results 

(apart from the particular cases of k = 1 and k = -), when applied for 

obtaining lower bounds by means of Lemma [ll.ll Also, when one is 
interested solely in finding upper estimates (in the most interesting ques- 
tions) the generalized Theorem lll.5l has no advantage over (the simple) 
Theorem 19.11 Finally, we have that most of the prominent problems in 
prime number theory which can be attacked by Selberg's sieve method 
are dimension 1. 

Now on imposing (fh,(l, L)) one obtains 



and 



S($f, p,z) = XW(z){l + 0(e-2 u }. 



(11.44) 



K = 1, 



(11.45) 
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Lemma 11.2. (Qi), (Q 2 (l,^))- Let 



r:=-^>0. (11.46) 
logz 



Then holds 



1 = W(z){F (t) + 0(-^-(t 3 + t- 2 ))}, (1 1.47) 



G(£z) v logz 
where Fq(t) is defined by 



2e y 

F (t) for o < t < 2, (11.48) 

T 

and by the differential-difference equation 

(-7T-— -)' = - * for r > 2. (1 1.49) 

tF (t) t 2 F (t - 2) 

If we apply lemma [TT~2*1 in Theorem lll.5l we obtain, corresponding 
to Theorem l 11.61 the following 

Theorem 11.9. (Qi), (Q.2(1,L)), (Q): For any real number £ > 1 and 

T .- ] 2iL i>0 ), (11.50) 
logz 

there holds 

S« P,z) < — XW(z){F (r) + 0(-^-(t 3 + t" 2 ))} + V 3VK„|, 
q logz ^-H ? 

d\P(z) 

(11.51) 

w/j<?r<* Fo(t) ij defined by (111.481) gotg? dll.49t . 

We can now repeat that above Buchstab-procedure (cf. description 
preceding Theorem lll.7t to obtain a general non-trivial lower bound for 
our sifting functions. To this end we choose in dll.321 ) 

zi = exp{ lo f g } (11.52) 
log log £ 
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(and also assume £ to be sufficiently large; indeed, we can even suppose 
that £ > z since the result below (Theorem 111.1 Qi is otherwise trivial 
because of ( II 1.59I )). This choice enables us to employ Theorem II 1.71 
for a lower estimate of S(g/ q , p, zi) and for the remaining part of the 
right-hand side in dl 1 .32ft we can use now Theorem II 1 .91 instead, but 
again with 

£ replaced by — and z replaced by p (11.53) 
P 

for the same reasons as before. However, here we encounter an addi- 

log(£ 2 /p) 

tional difficulty due to the presence of factors Fq( ) (instead of 

P 

1 in the previous case) stemming from our use of Theorem 111.91 with 
dll.53t . This difficulty is overcome by deriving from dll.33t the fol- 
lowing 

Lemma 11.3. (Hi), (Q.2)l,L)) : Suppose that 

2<zi<z<£, (11.54) 

and let t]j{t) be a non-negative, monotonic and continuous function for 
t > 1. Further, define 

logGf 2 /w) 

M : max ^( , ' ), (11.55) 

Z\w<<z logW 



Then holds 



Zco(p) log(£ 2 /p) 
— W(pM T — ) 



loez C LMW(z) log? 

= W(z)— 2f il,{t-l)dt+0{ 7 gZ ). (11.56) 
l°g&> J log 2 zi 

logf 2 
log J 

Thus using Lemma [Tl.3l (instead of (111.331) ) along with (111.321) one 

obtain, by the above procedure, without any more difficulty the required 
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Theorem 11.10. (Qi), {Q.2{^,L)),{Q): For any real number £ > 1 and 

log log £ 2 

t:= ° 5g (>0). (11.57) 
logz 



5K . p , z) > ^ W/ (T) + ( L(l0 f°f^ )5 )) - ^ 3^, 



q I log£ 



d|P(z) 

(11.58) 



where fo{u) is defined by 

/ ( T ) = 0/or < r < v = 2.06 • • • , (1 1.59) 
and by, with the function Fq ofLemma \11.2\ 

(t/ (t))' - F (t- l)/orr > v . (11.60) 

Now, before starting the iteration of the Buchstab-procedure (cf. the 
remarks made subsequent to the proof to Lemma fll.lt . we make the 
result of Theorem 111.101 explicit (for q = 1) by imposing the following 
condition (about R/s): 

Suppose that there are constants 

0<ar<l, A 3 (>1), A 4 (>1) (11.61) 

such that 

(R(l,a)) V yuV^I^I < A 4 -4- for X > 2. (11.62) 

log A 3 X 

(rf,p)=l 

Then one gets from Theorem lll.lOl in the case q - 1, on making the 
choice 

r = — ^ — (n.63) 

and noting that d\P(z) implies that fi(d) t and (d, p) = 1 , 
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Theorem 11.11. (Pi), (Q 2 (1,L)), («(l,or)): 

f logX L(loglog3X) 5 l 
W, p,z) > XW(z) \Ma-r2-) + 0( f * ) , (11.64) 

I lOgZ log A J 

vv/zere /o(t) z's defined by ( II 1.591 ) aztci dl 1-601) - 138 

Returning now to the remarkable iterative aspect of the Buchstab- 
procedure we see that on using Theorem 111.91 with an appropriate 
choice of zi , one has an upper bound for the first term on the right- hand 
side in (111.321) and also that using Theorem dl 1.101 ). with the replace- 
ments mentioned in dll.53t . in combination with Lemma H 1.31 (for fo 
in place of if/) there follows a lower bound for the sum in dl 1.32I) . Thus 
we arrive at another form of Theorem II 1.91 where Fq(t is replaced by 
some other (similar) function F\ (t) and this in turn also yields another 
form of Theorem II 1.101 where fo(r) is replaced by an /i(t) (related to 
Fi(t)). 

Continuing this procedure we are led to results of the type Theorem 
II 1.91 and 111.101 with (at the t th step) the following pair of functions 
(instead of Fo(T)fo(T) respectively) 

F^tX/^t), n = 0,1,2,'", (H-65) 
where (analogous to the first step as indicated in dll.88l l) 

7„(t) = for t < Vfl , 

/,(r) " = 'l-iJ(F^-l)dtfiBTT^V^ 

T 

with the number y^ defined by 



(11.66) 



if 



(F„(t - 1) - l)dt = 1 (11.67) 

(cf. the remark following dl 1.881 )) and similarly 

(F (T) for r<y;, 
JWt) :={ 1 rco (11.68) 
|l--fp-l)-l)*forr>v; 
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139 with suitably chosen numbers V for /z = 0, 1, 2, . . . 

The power of this procedure is demonstrated by the surprising fact 
that, at each step, the quality of the respective forms of the Theorems 
111.91 and 1 1 1 . 1 01 improves . Further, the sequence of numbers {v^} con- 
verges to 2 from above, as also does {v^}, and the pair of functions 
{Fjj,/^} converges to a pair of limit functions [F,f] converges to a pair 
of limit functions {F,f}, as fi — » oo : 

lim Vfl = 2 = lim V lim F/r), = F(r), lira f„ = /(r). (1 1.69) 
Now, from flT59l . (11 1 .66b . (QTjH) and (II 1.59t . we find that 

F(t) - — , /(t) = for < t < 2 (1 1.70) 

T 

and 

(tF(t))' = /(t - 1) and (t/(t))' = F(t - 1), for r > 2, (11.71) 
which gives on integrating from 2 to u 

Xu- 1 pU- 1 

and w/(w) = J F(t)dt, for m > 2. 

1 (11.72) 
Then we obtain from (II 1.72i . in particular, (cf. (II 1.70l i) 

2e r 

F(b) = for0<w<3 (11.73) 



and (so) further 



2 ^ 

f(u) = — log(M - 1) for 2 < w < 4. (1 1.74) 



Also, we have 



F(u) > 0, f(u) > 0,F(u) i,f(u) T (for u > 0) 

and lim F( M ) = 1 = lim f(u) (11.75) 
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and 

u? - U\ 

< F(ui) - F(u 2 ) < F{5)— -, 

U\ 

< f(u 2 ) - /(mi) < 2e y ll2 ~ Ul , for < 6 < u\ < u 2 , (11.76) 

Actually, when one knows these results (about F, f) there is no need 
to consider the sequence of pairs of functions {F M ,f M } (and so also the 
numbers v M , vp. One can instead iterate dl 1 .32b as well as Lemma [Tl.3l 
(with ip - F and if/ - f) and apply the Buchstab-procedure (as was done, 
for instance, to obtain Theorem lll.lOt only once. More precisely, in the 
iterated version of dl 1.321 ) among the various functions S occurring with 
different signs, one can apply Theorem lll.7l for those which are within 
its domain (of applicability). For those of the remaining S's which are 
to be estimated from above one can use Theorem 1 1 1 . 91 while the trivial 
lower estimate S > in used for the rest. It remains only to prove that 
this process converges. However, this can be done if the dimension k 
satisfies 

k<ko, (11-77) 
where kq is some constant greater than 1 . Therefore we can succeed in 
our case dl 1 .45b . 

Now we can state the final result of the Buchstab-procedure obtained 
in the manner mentioned in our previous remark. 

Theorem 11.12. (Oi), {Q, 2 {l,L)), (Q): For 

£>z, (11.78) 

there hold 

5« P ,z) < ^XlV(z){F(^) + 0(-4— )}+ V 3*^|, 
q \ logz (log^) 1 / 14 J j-i 

d\P(z) 

(11.79) 

and 

*K, p,z) > ^w(4/(!p£) + oi—^)) + ^ y^\ Rqd \, 

q ( logz (log^) 1 / 14 I j-L 

d\P(z) 

(11.80) 



149 



where the functions F, f are defined by dl 1.701 ) and dl 1.711 ), a«<i the 
O-constants depend atmost on A\ and 141 

This theorem is also true if 

1 <£<zbut «^ (11.81) 

with a positive constant A, in which case the O-constant in dl 1.791 ) de- 
pends also on A. 

Similar to Theorem lll.lll we can obtain, in the case q = 1, with the 
same choice as dl 1.631) the following final result from Theorem ll 1.121 

Theorem 11.13. (Qi), (Q 2 (1,L)), (R(l,a)): For 

z<X, (11.82) 

we have 

S(*/, p,z) < + 0(— - )}, (11.83) 

I logz (logX) 1 '!-* J 

W, p,z) > Xff(z)j/(a^) + 0(— |— -)}, (11.84) 
I logz (logX) 1 ' 14 J 

where the functions F, f are defined by dl 1.701 ) arcc? dl 1.711 ). aM<f 
O-constants depend atmost on A,-, z = 1,2, 3,4, and a. 

Although the functions F, f are invariant under the Buchstab - pro- 
cedure, in view of the fact that 

f(u)<F(u) Vw>0 (11.85) 

and the procedure described preceding (lll.77t it is natural to have some 
doubt as to whether the qualities of our final results (viz. Theorems 
lll.12l and lll.13t cannot further be improved. However, it can be shown 
that, for the sets 

srf = m v := In : |1 < n < x,Q{n) = |vmod2j,v = 1,2 and p = pi 

(11.86) 
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(cf. $n$l ). the relations (111.83b and dl 1.84b hold with equality signs 
(upto the leading term) for v = 1 and v = 2 respectively and for all 142 

values of u : — — > in both cases); (Actually, here we take X — —, 

log z ~ 2 

u>(p) = 1 and slightly modify R(l,a), with a = 1, via Theorem ll 1.12b . 

It is in this sense that the final form, as stated in Theorem II 1.131 of the 

(proper Selberg sieve is best possible. 



NOTES 



They survey of the Selberg's sieve given in this chapter follows the 
approach of Halberstam and Richert (T) and we refer to (Chapters |3J 
121 16] Q EI of) this book for the content of this chapter as well as for 
the applications of these results. So all the references below (unless 
otherwise explicitly stated) are referred to by 'l,c'. 

(fTT3l : cf. LemmalO 

(111.6b : cf. Lemma 5.4 

Theorem lll.il cf. l.c Theorem 14. II 

Theorem lll.2l cf. l.c. Theorem 15 .21 (see also under ( 111.12b below.) 

Theorem lll.3l cf. l.c. Theorem 12.21 Actually, Theorem 1 1 1 . 3 1 holds 
for z > X A also when the restriction in the product, p < z, is replaced by 
p <X. 

Theorem 1 11.41 cf. l.c. Theorem 15. II 

(111.12b : Actually, in the 0-term of (111.12b (and also of Theorem 
I11.2I L can be replaced by min (L, log z) so that the Theorem 1 1 1 .41 (and 
respectively Theorem 111.2b would include Theorem 111.31 (and respec- 
tively Theorem 111, li . This is, however, not surprising because of the 
fact that the condition (^(^ L)) includes (Q2OO). 

(111.13b : For more details concerning the description here, in con- 
nection with 

(111.13b : see l.c Chapter EJ 5. 

Theorem 1 11.51 cf. l.c. Theorem 6.1. 

Theorem lll.6l cf. l.c. Theorem 6.2. 

Lemma [Tl.ll cf. l.c. Lemma 7.1. 
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Theorem l 11.71 cf. l.c Theorem 17. II 

Theorem 1 11.81 cf. l.c. Theorem l7.2l Also cf. l.c. Theorem l2.5l for a 
slightly stronger result derived from Brun's sieve. 

Lemma [T 1.21 cf. l.c. Lemma 6.1 and also l.c. <I4. 1 8I > on p. 201. 

(II 1.48b . dll.49t : For the function Fq defined here, it can be proved 

that 

F (t) > 0, F (t) i ( for t > 0) and lim F (t) = 1. (1 1.87) 

T— *00 

For more details about Fo(t), cf. l.c. Chapter^ 3 (where this func- 
1 

tion occurs as ). 

Theorem II 1.91 cf. l.c. Theorem 6.3. 
Lemma [Tl.3l cf. l.c. Theorem 17 .21 
Theorem dll.101) : cf. l.c. Theorem 7.3. 

dll.59t . dll.60t : To form an idea of the introduction of this function 
/o(t) it is useful to observe that, by the procedure leading to Theorem 
II 1.101 here, the contributions to the leading term (apart from the factor 

— ^- XW{z)) are, in view of Lemma Hi .31 here (with tj/(t) = Fo(t)) and 

q 

Theorem ll 1.9l here. 



(Fo(f-D-l)dt =:/ (t). (11-88) 



This gives dl 1.601 ) and the choice of dll.591 ) made since /o(t) is 
negative if t < Vq (by dl 1.88I) ). where vq is defined by 



i r 



(F (t- 1)- l)dt = 1, (11.89) 



while one has always, on the other hand, the trivial S (£/ q , p, z) > 0, Now 
dTT~%9T) yields 

v - 2.06. 

Further, one can show, corresponding to dll.87l i above 144 



Mr) > 0,/ (t) T ( for r > 0) and lim / (r) - 1. 

T — >CO 



(11.90) 
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11. A Generalized form of Selberg's Sieve 



dll.62t : Clearly, this condition has been modelled such that Bombi- 
eri-type theorems are directly applicable. 
Theorem lll.lll cf. I.e. Theorem 7.4. 

fn77DL .. flT75l : Jurkat andRichert (fl (cf. I.e. (Chapter[Sl 2)). 
Theorem II 1.121 cf. I.e. Theorem 8.3. 

Theorem 111.131 Jurkat and Richert JT), Halberstam, Jurkat and 
Richert Q (cf. I.e. (Theorem 8.4)). Selberg [5] has pointed out that 
(according to an unpublished paper of J.B. Rosser) this result can also 
be derived from Brun's sieve. This has been proved, even with a bet- 
ter error term (namely, with — replaced by 1) by Iwaniec 1 1 1. Iwaniec 
[ 1 ] has also applied this improved version of Theorem lll.l3l to sharpen 
the bounds for the Legendre-Jacobsthal function Co(r), the maximum 
length a block of consecutive integers each of which is divisible by at 
least one of the first r primes. His result is 

C (r) «; r 2 log 2 r, 

whereas (our) Theorem 1 1 1 . 1 31 leads only to the estimate 

C ( ,(r)«r 2 exp{(logr) 13/14 }. 

In the opposite direction we have, by Rankin 

n , w y _ £ rlog 2 rlogloglogr 

Co(r)>£> n i 

(log log r) 1 

(cf. I.e. (p. 239)). (For the case k = - see Iwaniec 0, 0, and 

for k < - see Iwaniec [7j. However, for k exceeding some constant 
k\{> 1) Selberg's sieve seems to be always superior than Brun's sieve 
(for instance, in sifting values of reducible polynomials). 

(111.861 : This fact (about ( 111.861 ) was established by Selberg J5J (for 
< u < 2), and he added the remark that the sieve method "cannot 
distinguish between numbers with an odd or an even number of prime 
factors". 



Chapter 12 

Weighted Sieves 



AS ALREADY mentioned in the previous chapter our object is to use 
the 'final' results there for the proofs of the results of next chapter. 
For this purpose we need further improve to the quality of these re- 
sults,inspite of the concluding remarks of Chapter ^lj and this is achie- 
ved by considering a weighted sieve, which actually consists of a combi- 
nation of various sifting functions (so that the counter-examples (II 1.86t 
collapse). 

Before commencing the introduction of weighted sieves for the pur- 
pose mentioned above, we briefly point out as to how the final result of 
Chapter in particular (111.841 1. already yields a result in the direction 
of Theorem 1 13. 21 For this, let us consider 

#f :- lp + 2 : p < x\,p := p 2 (12.1) 

(cf. (I9.39t ). Here we can take 

X = li x,io(p) - — - — for p € p(i.e., for p>2) (12.2) 
p - 1 

and then the conditions (Oi), (£^(1, L)) are verified easily for some fixed 
constants A\, A2 and L(> 1). Further, Bombieri's theorem (Lemma 
[TO enables us to fulfill (8(1, a)) with 

a=- (12.3) 
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(for some suitably chosen absolute constants A3 and A4). Also we find, 
by EH, 

w(z) = nji - = 2 n (i - i)n 2 < P < z (g^) = 

- IJ (1 - ^)^(1 + O(^)). 

(12.4) 

Therefore, taking 

z, = x«,u > 4;u = 4.2 (12.5) 
say, we obtain from dl 1 .84b and (II 1.741) that, for some positive constant 

X 

S(£/,p,z) > c — 5— for x > x . (12.6) 
log X 

Now, note that the numbers counted on the left-hand side here have 
no prime divisor < z and so for each of these numbers we have 

x + 2>p + 2-p 1 ---p r > x r/u (12.7) 

which shows tht necessarily r < u or by d!2.5t that r < 4. Thus, letting 
x — > 00, we have that for infinitely many primes p holds 

p + 2 = P 4 (12.8) 

At this point it is worthwhile to notice that Theorem 1 1 1 . 1 1 1 (instead 
of (lll.84l i) would have also led to (I12.6I) . though with a smaller value 
for co, and so also to d!2.8t . We can express this remark by saying 
that (112.8ft follows from Selberg's upper bound sieve combined with 
the (one-step) Buchstab-procedure on using Bombieri's Prime number 
theorem (cf. also the remark involving (113.211 1. with respect to Theorem 
QT2below). 

Now we turn to the weighted sieves. The first weighted sieve, suit- 
able for our purpose also, was introduced by Kuhn (in connection with 
Brun's sieve). Since then there have been other more sophisticated 
sieves invented. However, we shall need only a special form of the 
simplest of these, namely Kuhn's sieve (cf. Theorem ll2.lt . 
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Theorem 12.1. (R(l,a)) : Let h be an even integer (determined with 
respect to x) satisfying 

< \h\ < x (12.9) 
and suppose that ( associated with a sequence stf)we have 148 

X = lix,ca(p) = — —forpe p h . (12.10) 
P- 1 

Let u and v be two real numbers (independent of x) such that 

-<u<v. (12.11) 

a 

Define 

W{srf;Vh,v,u,A):= J] {1 - - £ L ( 12 - 12 ) 

, vimw 1 x-<p<xu 

(a,P(x ljv ))=\ ^ a 
pePA 

2<ieR. (12.13) 

Then 

W(*/ : Vh,v,u,A) > e- y C(h)^—v 

log 2 x 



/(arv) 



i r i * i i 

_ F(v(a- -)) - + — ) (12.14) 

A J u t t (log*) 1 ' 15 J 



where C is defined by ( 19.421 ) and the 0-constant depends atmost on 
u, v, A3, A4 and a. 

Proof. We may assume that 

jc > X (w,v,A3,A 4 ,a). (12.15) 

Let us set 

z:=x%y;=x 1/u . (12.16) 

Then, by (ILTHl . 

W(=e/ : p A , v, k, /I) = 5 (.(/, p/i, z) 
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~\ Z s &P>Vh,z) = S(*f,v h ,z)-jYi' (12 - 17) 

z<p<y 1 

ptVh 

say. Since 2\h, by dl2.10t . the condition (Qi) is satisfied with an afoo- 
149 lute constant A\. Also (f22(l> J-)) is fulfilled with some absolute constant 
A 2 and, by dl 2.9b with 

L < 0(1) + V « log 3|A| « log log jc. (12.18) 

P\h F 

Hence we get, by dl 1 .84ft (since v > 1 by (112.1 It ). 

S(£f,p k ,Z) > XW(z){f( a v( 1 ^^)) + 0( 1 .,.- )}■ (12.19) 

log* (log*) 1 ' 15 

We apply Theorem 111.121 for the estimation of £ i , and for this we 
define (in terms of the constants from (R(l, or))) 

f = t ■ (12.20) 

\og A ^ a X 

Note that for each p in the range of £ i we have 

— > j (12.21) 

P \og A ^ a X 

Therefore applying dll.79t . for each term in 2i> with £ 2 /pm place 
of £ 2 , p = fa (and q = p so that (Q) is satisfied because of p > z and 
p e p/,) we obtain 

I> Z ^^w^( !2 ^) + o(r4j)} + Z ^)3^i* d i. 

i t^r P I ^ log 1/15 J ^ 

(d,p)=l 

(12.22) 

in view of d!2.21b . (I12.1U . dl 1.811 and d!2.18t . Estimating the last sum 
in dTO2l by means of (#(l,a)) (cf. dTT62l ). tiTTM and (IT2~T0l one 

gets 

i U^r^ logz (log*) 1 / 15 j io g 3 x 

(12.23) 
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after some simple considerations involving dl2.10t . dll.731 ). dll.751 ). 
<fTl~75]) . d!2l6t and (fTtT22l 

Now, from d 12. 101) . ([TTHfl i and (TITToT i. it follows that (cf, dTT^l ) 

nl e' y log log x 

(1 r) = : S(/i) + Q( ° * ) (12.24) 

2< P < Z P ~ l l0 § Z lo § * 

pxh 

where S is defined by (I9.42t and (so) satisfies, (cf. d!2.9t ). 

G(/z) = 0(log log 3\h\) = 0(log log jc). (12.25) 

Using (112.241 in both (112.1 9t and d!2.23t we are led to (by (112. 17t 
and (fT2~T3l ) 

WCef : p*,v,M) > — ^-<T r C(/i)v 
log 2 jc 

f 1 1 log(x o/p ) 1 ) 
/(«v)-- V -F(-f >-v) + 0( r ) (12.26) 

( A ztf<yP lo § x (logx)i5 ] 

by means of (lll.76t and d!2.20t . It remains only to deal with the sum 
in d!2.26t and for this we proceed as in dl0.24l > obtaining thereby 

> , -^(v(- ) - F(v- )— + 0{- ) 

r i dt i 

= F(v(a --))- + 0(- ). (12.27) 

Ju t t logX 



Using this in d!2.26l > yields d!2.14t and so the proof is completed. 

□ 



Regarding the terms containing the function / and F it turns out that 
for a suitable range of v one can have instead an expression involving 
only elementary functions (cf. dll.73t ) and (II 1.74t ). More precisely, 
we have 
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Lemma 12.1. Let 



1 2 4 

- <u<v, - < v < -(/or < a < 1). (12.28) 
a a a 



TTien (for 2 < A e R) 



1 * 2^ I 1 v - i 



/(arv) - - F(v(a - -))— = < log(av - 1) - - log 



a 



t t av I /i m — - I 

(12.29) 

Proof. Note that the arguments of the functions / and F in ( 112.291 ) sat- 
isfy, by HTM . 

2 < av < 4 and < v(a - -) < v(a - -) = av - 1 < 3. (12.30) 

u v 

Hence, by dll.73t and dll.74t one has that the left-hand side of 
d 12. 291 ) equals 

2e y \ 1 C dt , 
— log(av-l)-- . (12.31) 



This proves d!2.29ll . □ 

For our use in the next chapter it suffices to have the specialization 
of Theorem ll2.1l to the sequences 

:= [\p + h\ : p < x), 2\h, < \h\ < x, (12.32) 

where h is determined with respect to sufficiently large x, and with u 
and v restricted by d!2.28l ). Here note that for one has d!2.10t and 
R d = 0(E(x, d)) so that lemma M5\ (with k = 1) fulfills (R{l,a)) for 

or = - (12.33) 
2 



Hence we have, by Theorem 112.11 and Lemma [l2. li the required 
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Theorem 12.2. Let h denote an even integer (determined with respect 
to x) satisfying 

< \h\ < x. (12.34) 
Let u and v be two real numbers (independent of x) subject to 

2<m<v,4<v<8 (12.35) 

and let 

2 < A e R. (12.36) 

Then we have 



P <X x l/v< p<x Vu 
(p+h,l\p')=l p'\p+h 
p'<x l/v p'\h 

p'\h 



v 1 v-2 1 
log(r-l)--log - + 0( r ) . (12.37) 

{ 2 A U ~ 2 (\Ogx)Ts ) 

where S is defined by ( I9.42t and the O-constant depends atmost on u 
and v. 



NOTES 152 

(112.31 1: Here and in the sequel one recognizes the effect of Bombi- 
eri's theorem (and possible improvements, for instance, like the Elliott 
and Halberstam conjecture mentioned in the notes of Chapter |^1 3) 
when used along with Selberg's sieve. 

Among the various weighted sieves introduced successfully (for ap- 
plications) we mention first Kuhn 0, 0. Here the basic idea 
consists in forming the VK-function ( dl2.12l l). Next we have Selberg's 
weights of the form 



^ {l - lD(a)}(2>) 2 

aesZ d\a 



(12.38) 
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(with Ad's given by ( I9.28I )). By taking s$ = {n{n + 2) : n < x, 2 \ n\ and 
D{a) = d(n) + d(n + 2) here, Selberg [2] (cf. Selberg H, El) succeeded 
in proving that 

n(n + 2) = P 5 (12.39) 

holds for infinitely many integers n. This method, in the case where 
D{a) has (apart from a term to take care of the 'small' prime divisors of 
a) the form of the inner sum in Kuhn's W (cf. (112.121) ) has been pub- 
lished by Miech (TJ, E), and Porter Q, Next, Ankeny and Onishi Q 

have replaced Kuhn's constant weight — , attached to the inner sum in 
A12.12I ). by a logarithmic weight which is more effective and also has a 
smoothing effect on the prime divisors in that sum and this weight has 
been generalized by Richert [ 1 ]. Both the Kuhn weight and the logarith- 
mic weight can also be used simultaneously in (112.38ft (cf. Halberstam 
and Richert Q (Theorem 10.8)). In the first case, a generalization and 
refinement of Selberg's second method (mentioned above in connection 
with (112.391 1) can be found in Bombieri [6] (§ 8). Bombieri |6) (§ 9) 
(cf. [8 1) has used this method, which is both elegant and comparatively 
simple (through some what weaker than the other methods described 
153 above), for the problem p+2 = P4. (For a more general result which can 
be obtained by this method, are Halberstam and Richart [ 1 ] (Theorem 
10.9).) Buchstab [2] has generalised Kuhn's idea of constant weights 
by splitting up the inner sum in (I12.12t into many parts and attaching 
different constant weights to each part. This method is highly effective 
but is, on the other hand, very complicated. Roughly speaking, it may 
be described as splitting the inner sum into two parts and for one portion 
attaching constant weights which approximate to the logarithmic weight 
(thereby achieving a smoothing) while in the other portion the attached 
weights approximate to a smoothing in the opposite direction. 

Chen's ingenious idea, for which we refer to ChapterEl leading to 
an improvement in respect of some very prominent problems in addi- 
tive prime number theory does not need any of the more sophisticated 
weighted sieves described above, but just the very special Kuhn's sieve 
(in the form stated in Theorem ll2.2t . 

However, contrary to the other sieves methods mentioned above, this 
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method cannot be applied to a great variety of sieves problems. It can 
be directly applied to the problems of the type 



(cf . Theorem 113.11 113.21 the last one (cf . Theorem 110.2ft according 
to Halberstam (oral communication)). The problem of attacking other 
related problems by this method has not yet been tackled (cf. Notes of 
Chapter [Ql. and in this context the logarithmic weight procedure still 
gives the best results known to date. In case of dimension k = 1 we 
mention (Richert [ 1 1): 

Let F{n) be an irreducible polynomial of degree g(> 1) with integer 
coefficients. Let p(p) denote the number of solutions of the congruence 



N = p + P2 P + h - P 2 , ap + b - P 2 



(12.40) 



F(m) - Omod p. 



(12.41) 



and suppose that 



154 



p{p) < p for all p. 



(12.42) 



Then, we have 




(12.43) 



and if further 



p{p) < p - 1 for p \ F{0) and p < g + 1, 



(12.44) 



then (excluding the case F(n) = +n) we also have 




(1- 



p(p)-i) 




(12.45) 
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Hence, in particular (if lim = 0), there are infinitely many 

natural numbers n such that 

F(n) = P g+1 . (12.46) 

and also infinitely many primes p such that 

F(p) = P 2g+ i. (12.47) 

(Note that for d!2.47t there is no need to exclude the case F(n) = ±n as 
was done for (U2.45t .) 

The corresponding problems for polynomials in two variables 
F(m,ri) canceled more successfully. In this connection we refer, for 
instance, to the papers of Greaves fTl. P. Iwaniec 0, 0, [5], [6], and 
Huxley and Iwaniec Q. 

Regarding the question of almost primes in arithmetic progressions, 

i.e., 

P,- = lmodk,(l,k) = I, (12.48) 



Motohashi has proved, by averaging the logarithmic weights of Ric- 
hert [ 1 ], that there is a 

P 2 <k 1A , (12.49) 

and that there is a 

P 3 <klog 70 k (12.50) 

for almost all k and also the corresponding results valid for almost all 
£modk(k -> oo) (Motohashi O, (T^L (T3J). Without any exceptions 
we have only the existence of 

P 2 «£ 2 - 2 - (12.51) 
P 3 « k nn (12.52) 

and 

P r « k '-i for r > 2. (12.53) 

in <l 1 2.481) - (Actually, here the exponents can be replaced by slightly 
smaller ones.) (Richert Q. cf. Halberstam and Richert [1] (Theorem 
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9.6)). These results be compared with the corresponding ones for primes 
(cf. the remark following ( I6.55I) ). 

For more details and various other applications of weighted sieves 
we refer to Halberstam and Richert [ 1 ] (Chapter 191 and ITOt . 

As to the literature pertaining to the small sieves in general we refer 
to the extensive list of references given in Halberstam and Richert [1 1. 
We take this opportunity to add the following list of papers, which are 
neither included there nor are mentioned in these lectures so far: 

Bombieri Q, Buchstab G), Elliott 0, Hall Q, Hooley Q), (TO). 156 
Meijer Q, 0, Ramachandra 0, Scourfield 0, Wolke H, 0. 

Lastly, we take up now again (cf. Notes of Chapters 00 and 01 the 
question of comparison of the large sieves and the small sieves. The 
importance of the large sieves for applications in analytic number the- 
ory should be clear from the exposition in the Chapters through 
and it can hardly be overestimated. Also in arithmetical questions the 
large sieves turns out to be powerful when applied, in an auxiliary ca- 
pacity, along with 'small' sieves, not only via the Bombieri-type re- 
sults of Chapter 3 (cf. our condition (R(l,a)), (II 1.621 ). and under 
(112.3ft in these notes above) but also in a wider sense as is reflected in 
the proof of Chen's theorem (cf. Theorem 110.31 and the beginning of 
notes for Chapter [T3l. It is only in its arithmetical version (cf. Chapter 
that the large sieve, even in its most powerful form (namely, the 
weighted Montgomery- Vaughan sieve) suffers from some deficiencies 
when compared with Selberg's sieve (in particular, when compared with 
the weighted form of the latter). Without repeating our remarks, made 
in the Notes of Chapter00and0 we mention only the following facts: 
The large sieve (for example, Theorem 17. Il l can be used (as it is) to 
obtain only upper bounds, while Buchstab's method (cf. Lemma fll.ll i 

provides (at least in principle)a corresponding lower bound, however, 
log N 

for > 2. Theorem 1 1 1 . 1 3 1 (for X - N) gives a better estimate than 

logz 

does Theorem 17.11 Therefore, a suitably iterated form of ( 17.9ft should 
be investigated. With respect to the function co(p) the large sieve has the 
decisive advantage over the Selberg's sieve in that Theorem l7. ll imposes 
no restriction on the order of magnitude of co(p), so in particular a>(p) 
need not be bounded on the 'average' as required by our condition (Q2) 157 
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of the small sieve. However, the large sieve cannot deal with the very 

P 

important case a>(p) = , for example with the problem 

p-1 

F(p) = P r . (12.54) 

Here, the defect steams from the fact that the large sieves basically 
requires that n has to run through a sequence of consecutive integers. 
Thus the large sieve, while applicable to the problem 

F(n) = P r . (12.55) 

(leading to a trivial estimate when applied directly to sieve the sequence 
{F(p)} requires that one has to sieve the sequence {nF(n)} for the prob- 
lem (12.54), but then (cf. our remarks following dll.l4t ) the constant in 
the upper estimate is worsened by a factor 2. 



Chapter 13 



On Goldbach's Conjecture 
and Prime-Twins 

NOW WE have prepared the ground for providing the main results of 158 
(the second part of) these lectures. These results represent the best ap- 
proximation (in case sense) to the two most prominent problems in the 
additive theory of prime numbers (namely, those mentioned in the title 
of this chapter). The first proof of these results is due to Chen [ 1 1 (for 
simplifications see Ross Q. cf. Malberstan and Richert [1] (Chapter 
11)). Here we shall give a proof with further simplifications, in respect 
of (numerical) calculations and more specifically, involving only ele- 
mentary functions. Actually this is accomplished at the expense of the 
quality of the constants (c ) occurring in these results, which do not 
however affect the qualitative statement of these results. For the proof 
we use the special form (as given in Theorem ll2.2l ) of Kuhn's sieve and, 
instead of the earlier practices of translating this theorem into the lan- 
guage of additive number theory, follow the idea of Chen by subtracting 
an additional term. Then, as one would expect, reformulate the problem 
for an application of Theorem ll0.31 thereby completing the proof. 
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13. On Goldbach's Conjecture and Prime-Twins 



We start with (cf. d 12.37b . (fTTHoT O 
G(N):= £ {1-1 £ _i 2 -(13-D 

p<N-\ N lls <p'<N [/3 PiP2P3<N 

(N-p, n P')=l p'\N-p N lls <pi<N 1/3 <p 2 P3 

p'<tfl/8 N~p=p[P2P3 

As we shall see later, G(AO > implies (in the direction of Gold- 
bach's conjecture) that 

N = p + P 2 (13.2) 

159 is soluble. One can even obtain a lower bound for G(A0 (when N is even 
and large) in terms of N, by means of Theorems 112.21 and ll0.3l Indeed, 
with suitable choices for the parameters in these theorems, namely 

X = N,2\N eN,v = 8 (13.3) 
in both of them and further 

h = -N,u = 3, A = 2, (13.4) 

in Theorem ll2.21 while in Theorem ll0.3l 

h = N, (13.5) 

one obtains an estimate for an expression similar to G(A0- Actually, 
then one would have 

Z {1-5 Z 1-3 Z > 

P<N N 1/s <p'<N [/3 PlP2P3<N 

(N-p, n p'\N~p N lli < Pl <N lli <p 2 <P3 

p'KN 1 '* J ' p'JfN N-p=p lP 2P3 

p'fN 

> 4S(A0^{log 3 - \ log 6 - ±c(8) + 0(log K)~^ ). 

(13.6) 

Now a comparison of the left-hand side here with G(N) shows that 
we need the estimate 

£ X i«iog^+ 2 ^ <<A ^ 7/8 - 

' N-p=0modp u 

(13.7) 
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which provides a bound for the (difference in) contributions to these 
expressions arising from (the possible p = N — 1,) the numbers N - p 
satisfying either of the conditions 

(N -p,N) > 1 and p(N - p) = (13.8) 

Hence we have G(N) also the same lower estimate given by the 160 
right-hand side of (fT3T6l . Since, by (110.79b with v = 8, 

a{ log 3 - \ log 6 - \c(S)\ = 2 log I - 2c(8) > 2 log 



2 ° 2 v 7 J °2 
3 3 64 1 3 7 

log— (>-log -=;c )>0, (13.9) 

2 2 5 39 2 5 2.10 3 

we see that one has (for instance) the lower bound 

G(N) > c <S(N) — for > Afo, (13.10) 
log 2 N 

where A^o is some absolute constant. 

Next, we elaborate on the remark above pertaining to dl3. 12b . To 
being with observe that G(/V) does not exceed the part, (G*(N) say, of 
its defining sum comprising only of all the positive (i,e., > 0) terms. 
And also note that any term of dl3.lt with its second inner sum at least 
1 makes the first sum, accompanying it, > 1 so that such a term is < 
0. Thus we see that the terms occurring in G*(N) have their second 
inner sum empty and the first one is atmost 1. In otherwords, one (since 
p(N -p)t0) 

'G{N) < G*(N) < \\{p <N:N-p = p', 
ATf <p' <N 1 3}U{p<N:N-p = p'p 2 , 

N l/H < p' <N l 3 < p 2 } + \{p < N : N - p = p u ' ' 

P\ > N?\ U {p < N : N - p = p\P2,N l i < pi < pi}\ 

and also 

G(N) < \{p < N : N - p = P 2 }\. (13.12) 
From (113. 10b . (113. 12b we obtain the desired 



168 



13. On Goldbach's Conjecture and Prime-Twins 



Theorem 13.1. There is an absolute constant Nq such that for all even 
numbers N > Nq, we have 

\{p < N : N - p - P2} < cqC(N) — ^ — , (c >0), (13.13) 

log z N 

where cq may be taken constant defined in dl3.9t : in particular, there is 
always a solution of the equation 

N = p + P 2 if 2\N and N > N . 

The proof of the corresponding result regarding the generalized 
prime-twins proceeds analogously. Now we choose again (cf. (113.31 ) 

x,2\h,0<\h\<xKv = 8 (13.14) 
in both the Theorems 1 1 2. 21 [HOI and further 

M = 3,v = 8,,l = 2 (13.15) 
in Theorem 112.21 Also we note that the sum £ on the left-hand side 

p<x 

of (112.37b can be replaced by £ (cf. (113. 14t ) apart from a negligible 

p+h<x 

error of the order 0(x?). Lastly, by dl3. 14l >. we observe that the p's 
counted in ( 110.161 satisfy (irrespective of the sign of h) p + h < x and 
also p + h = pip 2 P3- 

Now, by the choices 013.14I ). ( 113.151 for the parameters, it follows 
from Theorems 1 1 2. 21 and l 1 0. 31 (corresponding to (U3.6t ) 

£ fi-j 2 1-5 Z 1} 

p+h<x y jc 1/8 <p'<.v 1/3 PlP2P3<* ' 

(p+h, n P'), p'\p+h x l l s <pi<x l l 3 <p 2 < P3 

p' <x Ui p/J(h p+h=pip 2 p3 (13.16) 

p'fh 

> 4S(/0 i ^{llog I - ic(8) + <9((logx)-*}. 
From here, on using the argument of d!3.7t and d!3.9t . one obtains 



169 



(for the analogue of G(N) the lower bound 

T h (x)= 2 {l-i 2 l-i 2 l) 

/>+/i<.\- l x 1/8 <p'<,r 1/3 PlP2Pl<* > 

(p+h, n />'), p>+/! A 1/8 <pi<X 1/3 <p2<P3 

p'<A- 1/8 P+h=pip 2 p3 
H(p+h)+Q 

> co&(h)-A- for x > ^o- 

V log x 

(13.17) 

Again the reasoning leading to dl3.HI ). and so also to dl3.12t . is 162 
applicable with p + h in place of N - p. Thus one arrives at the final 

Theorem 13.2. There is an absolute constant xq such that for any even 
number h (determined with respect to x) satisfying 

Q<\h\<x^, (13.18) 

we have 

\{p + h<x:p + h = P 2 }\c S(h)^r-forx > * ,(c > 0). (13.19) 

log^x 

where cq (again) may be taken as the constant defined in dl3.9t : is par- 
ticular, for any non-zero even number h, there are infinitely many primes 
p such that 

p + h = p 2 . (13.20) 



NOTES 

Chen's theorem affords a beautiful instance of the effective use of 
various powerful tools of numbers theorey. As an inspection of its proof 
(and those of Theorems 112.21 and I10.3I > discloses we have employed 
Kuhn's sieve, Selberg's sieve (several times), Bombieri's prime number 
theorem, Siegel-Walfisz theorem, contour integration, the hybrid form 
of the large sieve and Chen's new idea described below. 

Chen's idea, which has already been briefly indicated at the begin- 
ning of this Chapter, can be described as follows. (We shall confine our- 
selves, for this purpose, to Theorem 113.11 1 One first sifts the sequence 
N - p so that the remaining numbers satisfy. 



N-p = P 3 



(13.21) 
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(and then the numbers of these remaining ones is estimated from below, 
cf. Theorem ll2.2t . Now to remove from the rest those which are of the 

163 form P1P2P3, one subtracts (from the preceding lower bound) another 
sieve estimate (from above) for the numbers of solutions of 

N-p lP2 p3=P (13.22) 

(cf. Theorem ll0.3t . If, as has been shown to be the case for the problems 
under consideration, the lower bound exceeds the upper estimate (for 
( 113.22ft ). it follows then that (since the surviving members of {./V - p\ 
are all now P' 2 s) there must be solutions of the equation 

N = p + P 2 . (13.23) 

This procedure of inverting the equation ( I13.2U to (I13.22t has 
turned out to be much more fruitful than any further known improve- 
ment of the sieve method one has started with. Other successful appli- 
cations of this last step have been given by Indlekofer [2], Huxley and 
Iwaniec 

In this context we recall the remarks preceding ( 112.401 . To briefly 
expand on that statement, we mention that to attempt an improvement 
upon (112.46b (or ( U2.471 ) by Chen's method one would require (cf. 
(113.211 and (113.221 ). limiting ourselves now to the simplest case, for 

n 2 + 1 - P 2 (13.24) 

a satisfactory upper bound for the number of solutions of 

P1P2P3 ~ 1 = n 2 (13.25) 

with pfs restricted by some conditions (like those in Theorem 110.31 . 
Surprisingly we do not any method of obtaining a satisfactory estimate 
for the number of square in a sequence under appropriate conditions, 
and specifically not even in this case. An explanation would clearly be 

164 that the sequence of squares, even though more regularly distributed, 
is much thinner than the sequence of primes. This also indicated that 
the corresponding problem with respect to (112.461 (or (112.471 is much 
complicated. 
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Theorems 113.11 and 113.21 It is possible to state Theorem 113.11 in 
more precise form with respect to the prime factors of P%, as can be seen 
from a combination of dl3- 10ft and (113. lit . A similar remarks applies 
to Theorem 1 1 3 .21 also. 

Regarding the constant cq (cf. (I13.13t . ( U3.19t ) in these theorems 
we note that our value of 

c > 0.0446 (13.26) 

(cf. (I13.9l l) is rather small. One of the reasons for this is the choice 
v = 8, which was made to enable us to deal (cf. dl 1.731 . (II 1.74t ) with 
elementary functions only (and also to simplify numerical calculations), 
as has been mentioned in the introductions to this chapter, a convenience 
not available under the better choice (of all earlier proofs) of v = 10. Of 
course, we can replace (I13.26t by (cf. ( 113.91 1) 

co > 2 log I - I log ^ > 0.0679, (13.27) 

but a better constant can be obtained by taking v = 10. With this later 
choice, Chen [ 1 1 obtained 

c > 0.3354 (13.28) 

and any numerical integration one can even get 

c > 0.3445 (13.29) 

(see Halberstam and Richert Q (p. 338)). It returns out that v = 11 is 
close to the optimal choice, by considering the non-elementary functions 
F, f in a wider range, and then one is led to 

c > 0.3716. (13.30) 

In all these cases u is kept fixed to be 3, a value convenient in the 
arithmetical interpretation of the estimates of the weighted sieve. The 
constant cq can further be slightly improved by taking u > 3 but then 
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Chen's procedure becomes more complicated. As to the constant Co 
under consideration it is worthwhile to compare it with 1, in view of the 
conjecture of Hardy and Littlewood mentioned (under Theorem 110.1ft 
in the Notes for Chapter This suggests that our present constant cq 
should be capable of much further improvement. In this context, we add 
that one has theorems (corresponding to Theorems 113.11 and 113.2ft with 
/Vs in dl3. 13ft and ( 113.19ft replaced by P3 for a better (corresponding) 
constant 

13 

co > y (13.31) 

(cf. Helberstam and Richert [ 1 ] (Theorem 19.2ft ). Comparing the meth- 
ods of proof it is considered to be likely that the constant c$ in Chen's 
theorem can be further improved by using the logarithmic weights in- 
stead of Kuhn's weights. 

Continuing on with related questions we mention now a few results 
concerning Goldbach numbers (namely, those even numbers N which 
can be written as sums of two primes) 

N = p + p' (2\N). (13.32) 

Ramachandra [4] has derived from an estimate of (16. 1 It -type, actu- 
ally from (the uniform) 

2 N(cr, T, X ) « {q 2 Tf l - <T) \og u (qT) for 1 < <r < 1, (13.33) 

x 

that the numbers of Goldbach numbers in the interval 

x<N<x + x A (13.34) 

has the asymptotic formula 

\x A + AA (x A log~ A x) as x -> 00, if (1 >)/l > 1 - -, (13.35) 
2 g 

and has also deduced from (113.35ft . by combining with a result of Mont- 
gomery that (for x > xq) there is always a prime p in the interval 
( 113.341 ) such that both p + 1 and p - 1 are Goldbach numbers. 
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Many results have been proved (in various forms) in order to show 
that 'almost all' (with respect to the error-term corresponding to that of 
(113.151) even numbers are Goldbach numbers. The best result known 
here, upto this time, is due to Montgomery an Vaughan [ 3 ] and states 
that the number of even integers N < x, which are not Goldbach num- 
bers is 

« x 1 ' 6 , for some 6 > 0. (13.36) 

(For a previous result, see Vaughan [2|.) 

Further, turning to problems allied to Theorem 113.21 we have (cf . 
(I12.40l i) the observation of Vaughan [3] which yields, when combined 
with Chen's method the following result: Either the equation 

2p + 1 = p' (13.37) 

has infinitely many solutions or 

2p + l=PiP2 (13.38) 

has infinitely many solutions, in which extend event one , in particular, 
infinitely many solutions of the equation 

d(n + 1) = d(n). (13.39) 

This statement concerning (113.391 1 is a conjecture of Erdos and 
Mirsky Q (cf. Helberstam and Richert [ 1 ] (p. 338)). 167 

Lastly we mention an application, due to Jutila Q, to a question al- 
lied to Theorem 1 13. 21 He deduced from his result (I6.26t . by combining 
it with a theorem of Levin [ 2 1 (reference to Richert [ 1 1 would permit 
to replace '8' by '7' in the following statement), that for every integer 
r > 8 there exists a numbers 9{r) satisfying (with c an in (I6.28t ) 

2 + 4c 

(and 9{r) decreasing to C* for increasing r), such that 

x < p < x + x m ,p + 2 - P r , for all x > xq, (13.41) 
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is soluble. Also he started that a similar result can be derived for (an 
almost-) Goldbach problems, which may be interpreted as that the equa- 
tion 

N = p + P r ,N >N ,2\N,r>S, (13.42) 

has a solution in two 'almost equal' (-in a sense which is stronger for 
larger r-) numbers p, P r . 
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